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(0, 90, or 130 degrees), and cwo types of discr iminacive respor.se 
(correct or incorrect) were f ally crossed in the within subject s design. 
Additionally, type of addition (left or r\ght) and complexity of the 
to-be-constructed stimulus (low or high) were nested within levels of 
the construction facet but crossed with each other and all other facets. 

Correctness, confidence, total time, and from two to four compor.ent 
times were obtained on each item. Data were analyzed usir-^ regression, 
analysis of variance, and correlational techniques. 

In the regression analyses, latency measures wer-: 
percent of the sample failing each item. Correlation r 
slopes, predicted ♦:imes at maxir.um difficulty, and Iv 
reference constructs supported the basic hypothesis 
independence. In particular, speed of solving simple i. 
of both level on the task and reference abilities, Speeu 
complex items was strongly correlated with level on the tai 
highly correlated with verbal than spatial reference tests. 

The major results of the means analyses were: a) there we-e large 
facet interactions in all analyses, implying that complexity increased 
nonlinearly; b) encoding time increased linearly with incr^r- <:>-es in 
stimulus complexity; c) rotation time and errors increased '-i.most linearly 
over the three levels of rotation; when errors and latency >■ .e combined, 
the increase was perfectly lir^ear; d) rotation was unaffect by stimulus 
complexity, but slower for constructed than nonconstructed stimuli; 

e) construction was dif f erentially affected by the number of stimuli to 

be combined, the complexity of these stimuli, the location of the addition, 
and, most signif icantly, the couiplexity of the to—be—constructed stimulus: 

f) for match times "yes" responses were faster on correct items and "no" 
responses faster on incorrect items. Those who performed well on the 
experimental task showed little increase in match time over the rotation 
and construction facets, while those who performed poorly showed a linear 
increase in match tinie over the rotation facet and a large increase on 
construction items. 

Relationships between individual differences in total scores, cell 
totals, and the pattern of cell means over design facets were explored 
in the "correlational analyses. In general, correlations between correct- 
ness indices and reference tests supported the construct validity of the 
task and its facets, while correlations between latency indices and 
reference tests suggested more about the processes or strategies subj-acts 
were using to solve items. The major results were: a) total number 
correct was strongly correlated with reference spatial tests, while 
average total time was weakly related to verbal reference tests; b) steep 
slopes were related to verbal ability for the regressions of encoding time 
on stimulus complexity; rotation time on rotation and stimulus complexif'; 
and construction time on construction, stimulus complexity, and type of 
addition; c) a linear decline in correctness over the rotation facet '>;as 
related to high scores on two speeded rotation reference tests (Cards and 
Figures), while a linear decline in . correctness over the construction 
facet was related to high scores on two construction reference tests 
(Form Board and Object Assembly). 

It was concluded that the results of the "Experiment supported the 
hypothesis of speed-level independence and provided important insights 
into the nature of spatial ability. 
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importance of individual differences in aptitude for 
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CHAPTER I 



INTRODUCTION 

Research on aptitude for learning has entered a new era. 
Instructional studies have established that individual 
differences among learners often interact with instructional 
treatment variables (Cronbach and Snow, 1977; Snow, 1977). Much 
of this work has also underscored the need for deeper, more 
process oriented understanding of the psychological nature of 
aptitudes. Cognitive psychologists have begun the experimental 
analysis of individual differences in information processing, and 
there is nou reason to hope that coordination of these lines of 
work will lead to process theories of aptitude for learning from 
instruction CSnoM> 1976). 



Previ Qus Approaches t o the P rob 1 em . 

Three paradigms have dominated previous attempts to bridge 

the gap between trait and process descriptions of individual 

differences in aptitude. 

1. Direct modeling of the cognitive task* usually with 
latency as the dependent measure. Much of modern 
cognitive psychology couid be cl:ssified here. 
Particularly relevant examples are the work of 
Clark and his associates on verbal comprehension 
(Clark and Chase, 1972), and Shepard and his 
associates on mental rotation (Shepard and Metzler, 
.1971; Shepard and Feng, 1972; Cooper and Shepard, 
1976). Although the bulk of this work ignores 
individual differences in performance, such need 
not be the case. The most noteworthy ex am pie is 
the extensive work of Sternberg (1977) on 
analogical reasoning. In addition to examining a 
number of sources of individual differences in 
cognitive performance, Sternberg also provides for 
the possibility of modeling error as well as 
latency data. 



2. Computer Simulation. This approach is exemplified 
in the work of Simon and Kotovsky (1963) and 
Kotovsky and Simon (1973) on letter series items. 
In simplest terms> the goal of this approach is to 
develop computer programs that solve various tasks 
with the same relative ease that human subjects 
solve similar it ems. The program is considered a 
good model of human behavior if item solution times 
parallel those for human subjects. 

3. Correlating information processing parameters with 
aptitude constructs. This approach is exemplified 
in the work of Hunt> Frost, and Lunneborg (1973) on 
the relationships between verbal and quantitative 
abilities and various task parameters. Similar 
work with a wider- range of aptitude constructs has 
been reported by Snow^ Marshalek? and Lohman 
(1976). Although there are a number of variants of 
this technique, the goal is to find a set of 
process parameters that, collectively, predict an 
aptitude construct; or? alternately, a set of 
aptitude constructs that predict a process 
parameter. 

Select i ou _o f Exper imenta 1 Task . 

Most investigations of individual differences in cognitive 
processes have attempted to model performance on a task similar 
to an established test (e.g., Sternberg, 1977; Simon and 
Kotovsky* 1963). Since individual differences on the source test 
usually have a known network of relations with other mental 
tests, the investigator ordinarily does not have to establish the 
construct validity of his task. The main disadvantages of this 
approach are: a) the sources of inter-it em variation must first 
be "d ircovered , b) once enumerated, they must be experimentally 
disentangled, and c) these experimental manipulations may destroy 
the construct validity of the test, which was the main reason for 




using it in the first place. Since the ps ych ome t r i ca 1 1 y sound 
test may be a veritable hodge-podge of test facets, uncovering 
all the dimensions of inter-item variation is frequently a 
difficult endeavor. Further, once identified, it may be 
impossible to separate these dimensions experimentally. For 
example, in Paper Folding (French, Ekstrom, and Price, 1963), 
number of folds is necessarily confounded with number of holes in 
the final unfolded stimulus. 

An alternative approach is to construct a new task in which a 
limited number of design facets are varied systematically and, if 
possible, orthogonally. The disadvantage of this method is that 
the investigator must then establish the construct validity <s^ 
the task. There is no known network of relationships with other 
tests to fall back on. 

The Speed-Level Probl em 
Limi tati ons of Previ ous Methods . 

The most glaring deficit of previous research in this area 
has been the unwitting assumption that speed of performance is 
psychologically the same as power or level of performance. 

Although the distinction between speed and power is not 
synonymous with the difference between latency and correctness, 
those who model cognitive performance using reaction time data 
are usually studying speed of performance. In any experiment of 
this sort, it is necessary to "keep the error rate down" in order 
to minimize missing data. This is usually done by keeping items 
simple, thus eliminating individual differences in power. 
Ignoring power seems to have been a concession to methodological* 
and statistical limitations, not a deliberate choice. 

Focusing on speed of solving relatively simple items would 
not be a serious limitation if speed and power reflected the same 
psychological functions. However examination of the literature 
on the relationship between speed and p owe r s ug g es t s that, 
particularly in the area of spatial thinking, speed of solving 
simple items is largely independent of power in solving more 
complex items of the same type (Lohman, 1979a), 
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Perhaps the most serious limitation in modeling speed rather 
than level of cognitive performance arises from the fact that 
pouer in one domain correlates strongly uith power in other 
cognitive domains* and » more importantly* uith external criteria 
such as learning in academic situations^ On the other hand» 
speed of solving a simple task of one type is largely independent 
of speed of solving other simple tasks» Further, these various 
indices of speed are only slightly correlated uith pouer in their 
respective domains, or to external variables such as academic 
achievement (Lohman^ 1979a). 

Factor Analytic Ev i d enc e . 

This relationship betueen speed, pouer, and complexity 
repeatedly surfaced in a recent reanalysis of the major American 
factor analytic research on spatial ability (Lohman, 1979a). The 
studies uere reanalyzed using hierarchical factor methods and 
nonmetric multidimensional scaling. Simple, highly speeded tests 
invariably defined specific factors which fell in the louer 
stratum of a hierarchical model or at the periphery of a tuo or 
three dimensional nonmetric scaling representation. Further, 
these sorts of factors emerged only uhen extremely similar tests 
uere included in the analysis. On the other hand, pouer tests 
and their factors invariably fell in the upper levels of the 
hierarchical model or near the center of the scalin-g 
representations. Tests that defined these pouer factors uere 
complex, yet often dissimilar in other respects. 

The most consist en t factorial split uas betueen complex, 
pouer tests (e.g.. Paper Folding and Surface Development) that 
defined a factor called "Visualization" and simple, highly 
speeded tests (e.g.. Cards and Flags) that defined the "Spatial 
Relations" factor. 

Finally, an important conclusion of this revieu uas that the 
broad pouer factors cannot be subdivided into the various 
primaries of the Thurstone (1938) or French (1951; French, 
Ekstrom and Price, 1963) variety. Alternately, second order 
factors that are extracted from a matrix of primary factor 
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correlations do not coincide uith corresponding poue." factors. 
This nonhierarchical nature of the factors most likely reflects 
the independence of speed and power in cognitive performance. 

An important implication of this is that studying speed of 
solving simple tasks sheds little light on the nature of higher 
order aptitude constructs such as "intelligence," "verbal 
ability" or "spatial ability. This is especially troublesome 
for educational applications, since it is these more general, 
power aptitudes which will most likely interact with educational 
treatments . 

The data of Hunt et al., (1973) and Snow et al,, (1976) 
support this argument. Both of these investigations found that 
parameters derived from various cognitive tasks were only weakly 
related to higher order aptitude constructs. While some of these 
correlations were statistically significant, none were very 
large. 



Speed-Level Studies . 
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different within verbal tests, within spatial 
tests, etc. 

3. Accuracy. Correct responses are generally faster 
than incorrect responses for moderately difficult 
to difficult items. On simple items, however, 
incorrect responses are usually faster. 

*4. Correct "Yes" vs. Correct "No." Correct "yes" 
responses are usually faster than correct "no" 
responses, although some subjects evidence the 
opposite pattern on some spatial tasks (Cooper, 
1976). Further, latency for these two types of 



correct responses may relate differently to 
difficultyr and thus to complexity, 

5. Guessing. The error variance introduced by this 
factor can seriously cloud the relationship between 
speed and power. Furtherr there are differences in 
willingness to guessr both between subjects and 
within subjects across tasks and situations. The 
effect of guessing is most pronounced in 
experiments where a yes/no response is required. 

6. Alternative Solution Strategies. Many tasks can be 
solved in more that one way. There is some 
evidence that particular tests (such as the 
Guilford-Zimmerman Spatial Orientation test) are 
especially vulnerable. Items where several 
multiple choice alternatives are provided are also 
particularly suspect. 

7. Motivation. Thorndike (1926), Thurstone (1937) and 
Furneaux (1961) all agree that motivation (or 
persistence) can influence both level and speed, 
furtherf the relationships between motivationf 
speed and power are probably not linear; one can 
literally "try too hard." 



Complexity 

It was noted that the speed-power difference in spatial 
factors is confounded with differences in test complexity. Speed 
is measured when test it ems are simple» power when it ems are 
complex. But what is complexity? 

It is important to distinguish between stimulus complexity 
and processing complexity. Stimulus complexity has been defined 
in a number of ways. Garner found that judged goodness of matrix 
patterns was highly correlated uith equivalence set size (Garnerf 
1974; Garner and Clement, 1963). Attneave (1957), Arnoult (1960) 
and Stenson (1966) found that complexity ratings of random forms 
were strongly correlated with the number of turns or points in 
the form, the ratio of the perimeter squared to the area of the 



form (compactness), and the variabilitv of the angles on the 
perimeter of the form. 

Stimulus complexity may moderate the relationship between 
speed and level only if increases in stimulus complexity affect 
corresponding increases in processing complexity. Cooper (1975) 
and Cooper and Podgorny (1976) claim that stimulus complexity 
does not affect the rate of mental rotation. However, they 
employed only well learned stimuli in their experiments. The 
stimulus complexity of unfamiliar stimuli may alter the rate of 
mental rotation. This would help explain why capital letters can 
be rotated at a much greater rate than other stimuli (see Cooper 
and Shepard, 1976, for other explanations of this phenomenon). 
Finally, an important but overlooked possibility is that the 
crucial aspect of stimulus complexity in many difficult spatial 
tests is not the complexity of the stimulus figure that is given, 
but rather the complexity of the figure that must be mentally 
constructed (c.f., Shepard and Feng, 1972). 

Processing complexity, on the other hand, has received 
relatively little attention. It is most likely a function of the 
nature, number, and diversity of processes executed during the 
solution of a task. Certain processes may be more complex than 
others. Executing a process several times may be more difficult 
than executing it once. Similarly, switching between t'^jo 
different processes may be more difficult than executing the same 
process twice. Thus, the more complex spatial tests usually 
require the subject to employ several different processes, such 
as mental folding plus mental construction. Alternately, they 
may require the rep ea ted application of one process, such as 
mental rotation (e.g., as in the Guilford-Zimmerman Spatial 
Visualization test). Finally, they may require the execution of 
a relatively simple set of processes on a complex, unfamiliar 
stimulus (e.g., as in Graham and Kendall's (1948) Memory for 
Designs ) . 



Unpacking Difficulty . 

The key to the resolution of the speed-pouer problem^is in 
the construction of psychologically meaningful indices of item 
difficulty. This is perhaps best accomplished by equating 
difficulty with complexity of processing requirements and then 
constructing information processing models that account for the 
various dimensions of task complexity. 

A task like Block Rotation (Shepard and Metzler* 1971) 
provides a good example of hou the term "item difficulty" can be 
unpacked in a psychologically meaningful way. The usual 
procedure for determining item difficulty is to compute the 
proportion of the sample passing (or failing) each item. 
However* since only a handful of subjects pass the most difficult 
items* the correlation between speed and power would be based on 
the entire sample for the easiest item and on no subjects for the 
most difficult item. Another limitation is that these group 
statistics may not apply at the individual level. What is 
difficult for one subject may be relatively easy for another. 

At the individual level* item difficulty should be related to 
both correctness and latency. Those items that the subject 
solves correctly are obviously easier for him than those he 
fails. Within the set of correctly answered items* it is likely 
that it will take him longer to solve theitems thathe finds 
more difficult. However* difficulty is then defined in terms of 
latency* and so the two cannot be separated. 

Even if the group difficulty scale were a reasonable 
approximation for most individuals, one would still be faced with 
the problem of determining why some items were more difficult 
than others. If inter-item differences in processing complexity 
were un i d imens i ona 1 (e.g.* items differ only in the degree of 
angular separation between the two stimulus figures) then there 
is a good chance that one could "discover" this dimension by 
examining the group difficulty scale. However, if the inter-item 
complexity is two* three or four dimensional* there is little 
chance that one could intuit these dimensions from an inspection 
of the group difficulty scala. This becomes obvious when one 
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realizes that there may be interactions between the different 
compleKity fac€>ts, and interactions between subjects and the 
various com^il ejci ty facets. Further, it is highly unlikely that 
all facets of complexity present in the test will be represented 
in an orderly (factorial) fashion. The ps yc home t r i ca 1 1 y sound 
test may be a veritable hodge-podge of facets. 

Therefore, rather than attempting to analyze a complicated 
eKi.sting test, it would appear more reasonable to start from the 
other endr by constructing a mu 1 t i - f ac et ed test. For example, 
one could vary the dimensionality of the stimulus figures (two 
versus three); the number of rotations required, in addition to 
the angular value of each rotation (as in the Gu i 1 f o r d -Z i mme rman 
Spatial Visualization test); and the like. 

Returning to the block rotation task, assume that item 
complexity is entirely a function of the angular separation 
between the two stimulus figures. If item difficulty is captured 
by this uni d imens i ona 1 scale, then the following conditions 
should hold: 

1. For the group data, both mean latency for each item 
type and proportion of the total group failing each 
item type should be mon o t on i ca 1 1 y increasing 
functions of comp lexity (i.e., angular separation). 

2. For the individual data, average (within subject) 
latency of correct responses for each item type 
should be a mono t on i c? 1 1 y increasing function of 
complexity. Further, individuals should not skip 
points on the scale. Thus, within the limits of 
measurement error, if an individual correctly 
solves item type k he must also have solved all 
previoL^s it em types 1 to k-1. 

Assuming that the regression of log time on angular 
separation is linear, it is then possible to compute these 
regressions for the group and individual data. Both the slopes 
and intercepts should be positive for the group data as well as 
for each subject. 

If either parameter is significantly negative for any 



subject/ it suggests that the information processing model is 
invalid for that subject. Further* the in^.ercept must be 
significantly different from zero* while the slope may be zero. 
In the case of the intercept, this is because processing even the 
simplest item must take some time. On the other hand* the slope 
may be zero if it takes a subject as long to process a complex 
item as it does to process a relatively simple item. However, 
this also implies that the facet in question is not a meaningful 
aspect of complexity for that subject. 

The correlation between the intercepts and the lengths of the 
regression lines yields the correlation between speed of 
performing the easiest item types and the level reached on more 
difficult item types. The individual regression lines can then 
be projected so that they all extend to the point of maximum 
complexity. At this point the correlation between these (for the 
most part, predicted) latencies and the known lengths of the 
regression lines yields the best estimate of the relationship 
between speed of performing complex tasks and level of 
performance on those tasks. Projected or known latencies at 
intermediate points on the scale can also be correlated with 
level to yield intermediate values of the speed-level 
correlation . 

When formulated in this manner, it is obvious that the 
correlation between speed and power will remain constant over the 
range of item complexity only if there are no individual 
differences in the regression slopes. This has important 
implications both for the speed-power problem and for the 
g ene ra 1 iza b i 1 i t y of information processing parameters derived 
from simple tasks. If individual regression slopes are parallel, 
then the relationship between speed and power is constant 
throughout the range of complexity represented in the analysis. 
Parallel regression slopes also imply that individual differences 
in speed of solving simple tasks generalizes to the speed of 
solving complex tasks of the same type. 

On the other hand, if there are individual differences in 
slopes, then speed of solving simple tasks does not generalize to 
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speed of solving complex tasks. In either case, the relationship 
betueen speed and power, whether constant (i.e., slopes constant) 
or variable (i.e., slopes differ), is the crucial issue. A 
reasonable prediction in the area of spatial visualization tasks 
would be that there are individual differences in slopes, and 
that the relationship between speed and level is higher for 
complex tasks than for simple tasks. 



V^rifit ; Q?. Q± Individual Pi f f erences in Spatial Ability 
Correlational Studies . 

Given the importance of the speed-power problem, the need for 
a method for studying it, and the relevance of factorially 
designed experiments where complexity is manipulated in a 
systematic manner, the next step is to select the design facets. 
This presumes some knowledge of the important dimensions of the 
aptitude construct in question. 

At the most basic level, spatial thinking requires (among 
other things) the ability to encode, remember, transform* and 
match spatial stimuli. Factors such as Closure Speed (i.e., 
speed of matching incomplete visual stimuli with their long term 
memory representations). Perceptual Speed (speed of matching 
visual stimuli). Visual M3mory (short term memory for visual 
stimuli) and Kinesthetic (speed of making left-right, up-down 
discriminations) may represent individual differences in the 
speed or efficiency of these basic cognitive processes. However, 
these factors surface only when extremely similar tests are 
included in a test battery- Such tests and their factors 
consistently fall near the periphery of a scaling representation, 
or at the bottom of a hierarchical model. 

While the processes that these factors h y po t h e t i c a 1 1 y 
represent are certainly spatial in nature, they are not usually 
the referent of the term "spatial ability." While a number of 
spatial factors have been identified, only three survive a more 
rigorous examination using hierarchical factor methods (see 
Lohman, 1979a). All of the factors involve mental 
transformation. They are: 
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1- Spatial Relations. This factor is defined by tests 
such as Cards, Flags and Figures (Thurstone, 1938). 
It probably represents individual differences in 
the speed of mentally rotating or reflecting a 
simple visual stimulus. The tests that define this 
factor are all parallel forms of one another, and 
the factor emerges only if these or highly similar 
tests are included in the battery. 

2. Spatial Orientation, This factor appears to 
involve the ability to imagine hou a stimulus array 
will appear from another perspective. In the true 
spatial orientation test, the subject must imagine 
that he is reoriented in space, and then make some 
judgment about the situation. There is often a 
left-right discrimination component in these tasks, 
but this discrimination must be made from the 
imagined perspective. However, the factor is 
difficult to measure since tests designed to tap it 
are often solved by mentally rotating the array 
rather than reorienting an imagined self. 

3. Visualization. The factor is represented by a wide 



variety of tests such as Paper Folding, Form Board* 
WAIS Block Design, Hidden Figures, Copying,, etc. 
The tests that load on this factor, in addition to 
their s pa t ia 1 - f i g u ra 1 content, share two important 

features: a) they are aiX administered under 

> 

relatively unspeeded conditions, and b) most are 
much more complex than corresponding tests that 
load on the more peripheral factors. Tests 
designed to measure this factor usually fall near 
the center of a tuo dimensional scaling 
representation, and are often quite close to tests 
of Spearman's "g" (such as Raven Matrices or Figure 
Classification) or Cattell's (1963) Gf. 

In examining the type of tests that load on these three 




spatial factors, two major types of mental transformation appear 
to be involved. The first is mental movement. Reflecting, 
rotating, folding, or simply imagining that a stimulus is moved 
from one position in an array to another position, are all 
varieties of mental movement. 

The second type of mental transformation may be called 
construction. There are two types of constructions; 
reproduction (i.e., physical construction) and combination (i.e., 
mental construction). At the simplest level, reproduction is 
represented in tests like Thurstone's (1938) Copying, where the 
subject must correctly copy a stimulus design. At the next 
level, it is represented by tests like Graham and Kendall's 
(1948) Memory for Designs, where the design must be reproduced 
from memory. One important difference between these tests and 
those that load on more peripheral factors like Perceptual Speed 
or Memory Span is that the design must be reproduced, not just 
recognized, and the reproduced design must be a veridical 
representation of the stimulus. Retaining a veridical mental 
image of a design may be an important component of other complex 
spatial tasks, such as Hidden Figures (French et al.,1963). 

In the mental construction tasks, on the other hand, the 
subject must actually construct a mental image, usually by 
reorganizing the stimulus in a new way. The clearest examples of 
this sort of process are tests such as Form Equations (El Koussy, 
1935) and Paper Form Board (eg,, Thurstone 1938; French, Ekstrom 
and Pricef 1963), Mental construction is an important component 
of msny complex spatial tests. Fcr example, in Surface 
Development (French et al.,1963), the task is not simply to fold 
the sides of the figure mentally (i.e., mental movement), but 
rather to construct a three dimensional figure from a two 
dimensional drawing. Similarly, in Paper Folding (French et 
al,,1963), the examinee must construct new holes as he mentally 
unfolds the stimulus. Finally, mental construction may take the 
form of mentally deleting parts of a stimulus, as in Match 
Problems (Guilford and Hoepfner, 1971). This may also be an 
important component of tests such as Embedded Figures (Witkin, 
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Oltman, Raskin and Karp, 1971) or Hidden Figures (French et al., 
1963) . 



Experimental Studies .oj. Spatial Thinking . 

Until the pioneering work of Shepard and his colleagues, 
evidence on the nature of spatial transformation processes was 
limited to the speculations of factor analysts on the nature of 
their factors. Several important insights into the nature of 
spatial thinking have emerged from this work. The first 
significant finding was that the time required to determine 
whether two stimuli were identical was a linear function of the 
angular separation between them (Shepard and Metzlerr 1971). 
This result was replicated in later studies using alphanumeric 
characters (Cooper and Shepardr 1973a, 1973b)r random two 
dimensional polygons (Cooper and Podgornyr 1976; Cooper^ 1975), 
and two dimensional stimuli that had to be mentally folded into 
three dimensional cubes (Shepard and Feng, 1972), Other 
important findings were: a) the function relating time to 
perform "in depth" rotations to the angular separation between 
figures was essentially the same as that for picture plane 
rotations (Shepard and Metzlerr 1971); b) the time required to 
prepare for an object in a specified orientation was a linear 
function of the angular separation between the specified 
orientation and the normal or trained orientation (Cooper and 
Shepardr 1973ar 1973b); c) the rate of rotation for well learned 
complex figures was essentially the same as that for simple 
figures (Cooper and Podgornyr 1976); and d) when experimental 
conditions were equated^ there were no dif'ferences between the 
rate of rotation for two and three dimensional stimuli (Podgorny 
1975), These findings place important limitations on the type o 
models that may be advanced to account for the phenomenon. In 
particular^ Shepard and his colleagues have argued that the 
internal processes and representations underlying the rotation 
task are "analog" in nature (Cooper and Shepard, 1973b; Metzler 
and Shepard, 1974; Shepard, 1975). 

Experimental evidence on the nature of mental construction 
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processes is less extensive. Palmer (1974) found that the time 
to synthesize a stimulus mentally was inversely related to the 
"goodness** of the component parts. Houever, there were important 
individual differences in discriminative reaction time to the 
test stimulus, indicating that some of the subjects may not have 
completely synthesized the component parts. Glushko and Cooper 
(1976) used a different paradigm to investigate mental 
construction. They asked subjects to construct a mental image of 
a composite figure on the basis of a verbal description of the 
component parts. Construction time increased with the number of 
component parts, but discriminative reaction time to a test 
figure was independent of figural complexity. 

Ind ; vidual Pi f f erences jji the Experimental Studies . 

Although most experimental paradigms relegate individual 
differences to the error term^ individual differences in 
performance on spatial tasks have been recognized to be striking 
enough that investigators are now forced to deal with them. Even 
uithin small, selecv samples, the estimated rates of mental 
rotation for individual subjects have differed by a factor of two 
or three (Cooper and Shepard, 1976). There are important 
individual differences in discriminative reaction time to the 
test stimulus as uell. Cooper (1976) (see also Cooper and 
Podgorny, 1976) found that, for some subjects, "same" responses 
uerer on the average, faster than "different" responses. 
Further, the speed of "different" responses was not related to 
the similarity of the test stimulus to the standard. For other 
subjects, "different" responses were generally faster than "same" 
responses, and "different" reaction time decreased as the 
standard and test stimuli became increasingly dissimilar. Cooper 
(1976) speculates that these patterns may reflect important 
differences in the nature of the comparison processes. 

Finally, there are important individual differences in the 
ability to solve the more difficult spatial tasks in the first 
place. For example, of the ten Stanford undergraduates ujho 
participated in the Shepard and Feng (1972) experiment, three had 
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error rates so high (16-29X) tha 
Furtherr the correlation (for th 
between the number of errors and 
error free items was -.28^ which 
argument here regarding the rela 
1 ev el . 



t their data were discarded. 

e seven remaining subjects) 
mean reaction time on the 40 
is consistent with the previous 

tive independence of speed and 



Solution St rateq i es . 

Individual differences in solution strategy are a major 
problem for both experimental and correlational studies of 
spatial thinking. Whi.le some subjects solve verbal problems such 
as anagramSf syllogismsr and three term series problems using a 
predominately spatial strategy* it is possible to construct 
verbal tasks where spatial strategies would be oi little or no 
assistance (e.g., a simple vocabulary test). On the other hand » 
it is extremely difficult to devise spatial tasks that cannot be 
solved by some nonspatial strategy. 

Further^ subjects often change strategies as they become more 
familiar with the task* or as item difficulty increases (Meyersr 
1958; Barratr 1953; Lohman, 1977a). Careful selection of the 
experimental task and open response format can eliminate some 
strategy shifts. Although retrospective and introspective 
reports are often of dubious validity* they can be obtainedr and 
may suggest at least major individual differences in strategy or 
radical shifts in strategy over items. 

However* relationships between individual differences in 
patterns of scores over facets and reference constructs can 
provide more direct evidence on the nature of overall strategies 
and shifts in strategy ov£:*r items. Fur example; the correlation 
between the pattern of latency cell means over a rotation facet 
and reference variables may suggest that steep slopes are related 
to verbal ability and* by implication* verbal strategies. Other 
mean contrasts may be formulated to address both general and 
specific hypotheses about strategies, such as: 

1. Do changis in difficulty effect overall changes in 
speed -accuracy tradeoff? 
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2. What is the relationship between individual 
differences in speed-accuracy tradeoff, overall or 
over a particular facet, and reference variables? 

3. What are the relationships between individual 
differences in the patterns of cell means for 
latency or correctness over a particular facet and 
reference variables? 

Examination of the relationships between individual 
differ en ces in the patterns of scores over design facets and 
reference tests can provide a powerful method for exploring 
intra- and inter-subject variation in solution strategy (Calfee, 
1976) . 



and reference 
f the resul ts of 
es and limitations 
ant to demonstrate 
xnai periormance on the experimental task is related to 
performance on eference tests or factors, such correlation (or 
lack of it) cannot be considered the final arbiter of construct 
validity. There are good tests and bad tests. Many tests, and 
especially instruments intended ohly for research (e.g., French, 
Ekstrom and Price, 1963) are not carefully constructed, even by 
psychometric standards. Further, most, if not all, test items 
can be solved in more than one way. This is particularly true o 
spatial tests. For ex ample, only seven subjects in this 
experiment reported that they always rotated the stimuli on the 
Cards and F i g u r es t es t s . Five reported that they never rotated, 
while most (13) reported rotating some stimuli and using logical 
cues to solve other it ems. Data were unavailable for six 
subjects. Thus, it is unreasonable to expect large correlations 
between performance on these tests and performance on rotation 
items in the experimental task. 

Similarly, factors representing common covariation in 
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Methodological Issues 
Limi tat i ons of Tes t s and Factors . 

Since correlations between task parameters 
tests and factors constitute a major portion o 
this experiment, some comments on the advantag 
of such data are in order. While it is import 



clusters of tests can be shadou or substance. Some of the more 
specific factors (e.g.> Spatial Relations, Closure Speed, 
Perceptual Speed) are the product of including uhat amounts to a 
parallel forms reliability coefficient in the matrix of test 
i n t ere o r r el a t i ons . Such factors regularly disappear uhen the 
tests are altered to remove method covariance. Further* factor 
analysis assumes that subjects are not solving tests in different 
uays . Correlations and factor patterns can change drastically 
uhen data are analyzed within distinguishable strategy groups 
(French. 1965; Lohman, 1979s). 

Thus, correlations betueen total scores, cell totals* or 
individual contrast scores and reference tests can provide useful 
but limited evidence on the construct validity of the score or 
contrast. The importance of reference tests lies in the link* no 
matter hou ueak or uncertain it may be* with the network of 
individual difference constructs established through fifty years 
of correlational research on human abilities. 

Tuo more specific limitations of the reference test 
correlations should also be noted. First* the 30 subjects who 
participated in this experiment were deliberately selected to 
represent a wide range of verbal and spatial abilities. 
Correlations betueen task parameters and reference tests could be 
quite different in a more r .tricted sample. 

Second* subjects apparently interpreted the instruction to 
"solve each item as quickly as you can" in quite different uays. 
While the type of analysis performed provided considerable 
insight into the effects of speed-accuracy tradeoffs* the results 
of the experiment could change substantially if subjects uere 
somehou induced to perform as rapidly as they could uithout 
sacrificing accuracy. 

Finally* it must be emphasized that this experiment uas 
designed to investigate the speed-level hypothesis. Thus* even 
though considerable evidence on the nature of individual 
differences in spatial thinking emerged from the analysis* the 
study uas not designed to investigate such hypotheses rigorously. 
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Contrast Correlations 

When subject-factor interactions are large, plots of mean 
scores may be misleading. This is especially true when the 
sub jec t- f ac t or interaction is large and significant, and the 
factor itself is nonsignificant. 

Subject variability in the dependent measure over a 
particular design facet may be examined by computing contrast 
scores for each subject and then correlating these scores with 
overall performance and reference tests (Calfee, 1976). Examples 
of procedure for computing individual contrast scores are 
presented elsewhere (see Lohman, 1979b). 
Conclusions . 

Previous research on the nature of aptitude processes has 
ignored the important differences between speed and level of 
performance. In particular, the speed of correctly solving 
simple items is largely independent of level attained c more 
complex items of the same type. Failure to recognize this fact 
has produced considerable confusion in the factor analytic 
literature, and in factor models of human abilities. Likewise, 
it has resulted in constructs, models, and parameters in 
experimental psychology that are of questionable 
general izabi 1 i ty, particularly for process understandings of 
constructs such as "verbal ability," "spatial ability," and 
"intelligence." 

The key to the resolution of the speed-power problem is the 
construction of psychologically meaningful indices of item 
difficulty. This is perhaps best accomplished by equating 
difficulty with complexity of processing requirements, and then 
constructing information processing models that account for the 
various dimensions of task complexity. 

Within the area of spatial abilities, the two major 
complexity facets are construction ar.d mental movement. 
Construction may be divided into physical construction (i.e., 
reproduction) and mental construction. Mental movement includes 
rotation, reflection, folding, and transposition. Complex 
spatial tasks usually require more than one of these processes, 
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such as folding and mental construction. Other complex spatial 
tasks require either the repeated application of a single process 
(e.g., mental rotation), or the execution of one process on a 
complex^ unfamiliar stimulus. 



Ov e r V i eu o i th e, Study . 

The present experiment was designed to explore individual 
differences in spatial ability, with particular emphasis on the 
interrelationships of speed, level ar.d complexity, and their 
relationships with aptitude reference constructs. Complexity was 
represented in two ways: stimulus complexity and processing 
complexity. Stimulus complexity was determined by the number of 
points on both the given and to-be-constructed stimuli. 
Processing complexity was manipulated by increasing the "amount" 
of a particular operation (i.e., degrees of rotation or number of 
stimulus pieces that are added together) as well as the variety 
of operations (construction and rotation alone versus 
construction plus rotation). 

Mental movement uas represented by mental rotation, in the 
tradition of Shepard and his colleagues (Shepard and Metzler, 
1971; Cooper and Podgorny, 1976). Construction uas represented 
in tuo ways: a) by requiring the subject to cc bine two or three 
jstimulus figures mentally, and b*^) by asking hir to draw the 
combined, rotated, or combined and rotated stimulus figure. 

The major hypotheses of the investigation were: 

1. Speed of performing simple spatial tasks is largely 
independent of the level of such tasks that a 
subject can ultimately solve. 

2. Level scores correlate more strongly than 
corresponding speed scores with higher-order 
aptitude constructs such as G, Gi , and Vz. 

The present study also provided an opportunity to study the 
relationships between individual differences in the patterns of 
latency and correctness cell means over design facets and 
reference aptittjde constructs. 
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CHAPTER I I 
DESIGN AND PROCEDURE 



Desi :?n 

There were four. major components in the design of this 
experiment: the experimental task, the draw trials, the simple 
reaction time control, and the reference constructs. 

Experimental TasH 

The design of the experimental task is shown in Figure 2.1. 
All design facets were fully crossed except level one of the 
construction facet, which was not crossed with type of addition 
or product complexity. 

Levels of the construction facet refer to the number of 
stimulus figures presented during a trial. When more than one 
stimulus was presented, later stimuli were to be added to the 
initial stimulus in a prescribed manner to construct a new 
stimulus. 

Levels of the rotation facet indicate the number of degrees 
through which the subject was required to rotate the given or 
constructed stimulus. When both construction and rotation were 
required op a given trial, construction always preceded rotation. 

Stimulus complexity refers to the average number of points 
on the stimulus figures presented during a trial. The levels 
were low (Z - ^ points), medium (5-6 points), and high (7-8 
points) . 

Match refers to whether a correct or incorrect test stimulus 
was presented at the conclusion of the trial. 

Type of addition refers to whether the to-be-added stimuli 
were added to the right (level 1) or the left (level 2) of the 
initial stimulus. 

Product complexity refers to the number of points on the 
constructed stimulus. Again, low meant 3 or 4 points and high 7 
or 8 points. 

Since the entire design was replicated within each subject, 
subjects may be considered the seventh fully crossed factor. 
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Figure 2.1 Design of the experimental task. 
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Pr?u . Trials 

The draw trials were chosen by taking a one quarter fraction 
of a 2 factorial design in which the match facet was eliminated 
and all three level factors we-e converted to two level factors. 
This was accomplished by ignoring level one of the construction 
facet, defining a rotate and no rotate factor and ignoring level 
2 (medium complexity) on the stimulus complexity factor. 

The quarter fraction of this design defined 16 cells. One 
item was constructed for each no rotation cell and two for each 
rotation cellr one for both 90 and 180 degree rotaf-ions. 
Furtherr magnitude of rotation was counterbalanced with the four 
values of product complexity (3,4,7,8) within each level of the 
construction facet. 

In addition to these 24 items (eight from the no rotation 
cells and 16 from the rotation cells), six items were constructed 
to represent the crossing of rotation and stimulus complexity at 
level one of the construction facet. The complete set of draw 
trials thus contained 30 items. 



Simpl e Reaction Time Control 

This condition consisted of three consecutive blocks of 20 
true-false color discrimination trials. The ten true and ten 
false trials were separately rand,omized within each block. 

Reference Constructs 

Since the major goal of the study was to examine individual 
differences in spatial thinking, the selection reference 
constructs constituted an important aspect of the overall design. 
Tests representing general intelligence, Gf, Gc , Visualization, 
Perceptual Speed, Closure Speed and Memory Span were already 
available for these subjects. Additional tests were chosen to 
tap particular aspects of individual differences in spatial 
thinking not clearly represented in any of the previous reference 
tests. Cards (French, Ekstrom and Price, 1963) and Figures 
(Thurstone, 1938) were chosen to represent the Spatial Relations 
factor. Copying (French, Ekstrom and Price, 1963) and Memory for 
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Designs (Graham and Kendall. 1948) were chosen to represent 
rudimentary aspects of spatial construction (see Chapter I). A 
complete list of the reference battery, with a brief description 
and example item from each test is reported in Lohman (1979b). 



Assignment o f 1 1 ens to Blocks 

The experimental task uas organi^^ed into five blocks. The 
first block contained the 60 reaction time control trials. The 
next four blocks were composed of experimental and draw trials. 
The 216 experimental trials were first randomly assigned to one 
of the four blocks under the constraint that all main effects be 
kept independent of block. Since there uas only one observation 
per cell at levels 2 and 3 of the construction facet, some 
confounding of interaction vectors and block uas inevitable. The 
degree of confounding uas determined by correlating vectors 
representing the linear and quadratic components of blocks (in 
the order administered to all subjects) uith the design vectors. 
For both linear and quadratic block vectors only five 
correlations uith design vectors uere grtater than .20. For the 
linear component, only one correlation uas above .25 (KRST2, 
r=.27). For the quadratic component, tuo correlations uere above 
.25 (RSMPl, r=.27; and KRSMTP2, r= . 27 ) . Thus, even for the 
design vectors most strongly correlated with block, only 7% of 
the variance accounted for by those vectors may be attributed to 
linear or quadratic effects over blocks. 

The 30 drau trials uere also randomly assigned to one of the 
four blocks, for totals of 62, 61, 61, and 62 items in blocks one 
through four respectively. Items uere then randomly ordered 
within each block before filming. 

Sub lec ts 

Subjects uere selected from an initial reference population 
of 241 Palo Alto high school students and 123 Stanford University 
undergraduates. A large reference battery uas administered to 
these subjects in the Spring of 1976. Most had participated in 
other experiments conducted by the Aptitude Project. Of the 




original 171 males. 158 were available for the present 
experiment. Those who had participated in similar experiments 
uere eliminated. Thirty-one subjects were then selected from the 



battery. The first spatial composite was formed by normalizing 
and then summing scores for Paper Folding, Surface Development, 
Hidden Figures (all from French, Ekstrom and Price, 1963), BlocI 
Design, Picture Arrangement (both from Wechsler, 1955) and the 
advanced Raven Progressive Matrices (Raven, 1962). This 
composite is akin to Cattell's (1963) Gf or Zimmerman's (1953) 
Vz. The second spatial composite was formed by normalizing and 
then summing each subject*s scores on Paper Form Board (French, 
tkstrom and Price, 1963) and Object Assembly (Wechsler, 1955). 
Although the factor analytic evidence for the separation of these 
tuo tests from those in the first composite was weak, both tests 
uere hypothesized to involve a form of mental construction that 
could be important in the present experiment. A scatterplot of 
scores on the tuo spatial composites for the available population 
of males is shown in Figure 2.2. Subjects selected for the 
present experiment are represented by squares in this figure. 
The correlation between the two composites was .68. 

The verbal composite was formed by summing normalized scores 
for an abbreviated version of the Terman Concept Mastery Test . 
(Terman, 1950; however, see Lohman, 1979b), Information, 
Comprehension, Similarities and Vocabulary (all from Wechsler, 
1955 ) . 

A scatterplot of scores on the first spatial composite and 
the verbal composite for the 158 males is shown in Figure 2.3. 
Again, subjects participating in the present experiment are 
represented by squares. 

As can be seen in Figures 2.2 and 2.3, the subjects who 
participated in this experiment were quite representative of the 
reference population on both spatial ability and verbal ability. 

All subjects were paid for their participation. One subject 
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Figure 2.2. Scatterplot of scores on the two spatial composites for all 158 males in the reference 
population. 
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Figure 2.3. Scatterplot of scores on the first spatial and verbal composites for all 158 males In 
the reference popalatlon. 
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Overvi eu q± Trial Types 

Trials differed in the number, variety and magnitude of 
transformations required, the type of test probe, and the 
complexity of the given or constructed stimulus. Ignoring for 
the moment type of test probe* stimulus complexity and product 
complexity* the major variants of trial types are shown in Figure 
2.4. For purposes of illustration, all test stimuli are correct. 

As is shoun in Figure 2.4, from four to six frames were 
presented on each trial. The first and last franies were always 
•'ready" and "confidence" respectively. The first stimulus was 
presented on the second frame, and the test stimulus or "draw" 
frame immediately preceded the "confidence" frame. 

Column one in Figure 2.4 shows the sequence of events when 
no rotation or construction were required: first the "ready" 
frame, next the presentation of the bc^se stimulus then the test 
probe, and finally the confidence frame. 

Columns two and three in Figure 2.4 show the sequence of 
frames for 90 and 180 degree rotations. Type one addition is 
shown ill columns four and five, and type two addition in columns 
six and seven. Note that when two stimuli are to be added on the 
second frame, the stimulus n the left must be added first. Also 
note that the "+" sign indicates the location of the previous 
stimulus. Two examples of the four possible combinations of both 
construction and rotation are shown in columns eight and nine. 
Finally, column ten presents an example of a draw trial. Draw 
trials were the same as any other trial except that an 
instruction to draw the resulting figure replaced the test probe. 



Const rue t i on o f Stimuli 

Seven sets of 75 stimuli were obtained by randomly selecting 
and then plotting three to eight pairs of points in each of seven 
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Figure lA Examples of the major trial types in the experimental task. 



different 3x3 to 8x8 grids. The points were then connected using 
the Method 1 of Attneave and Arnoult ( 1 956 ). An informal pilot 
study uith these stimuli revealed that most were far too 
difficult to remember or manipulate, especially when attempting 
to combine them mentally. In particular, irregular forms were 
much more difficult than regular or symmetric forms. 
Accordingly, irregular stimuli were "normalized" by making almost 
parallel lines, parallel, nearly symmetric lines, symmetric, 
angles and lines that were nearly equal, equal, and angles that 
»uere nearly 90 degrees, right angles. Four to eight point 
stimuli without at least one right angle, or two parallel or 
symmetric lines were then eliminated. Each figure was then 
oriented so that either one line or the major axis was vertical 
or horizontal. Examples of stimuli from each of the three levels 
of complexity are shown in Figure 2.5. Since all three point 
stimuli are triangles, stimulus complexity was necessarily 
confounded with stimulus familiarity (or uniqueness) in the 
design. Every effort was made to avoid using the same stimulus 
more than once. Thus, even though all three point stimuli were 
triangles, there are an infinite number of different triangles. 
Some repetition of forms was unavoidable, particularly on those 
items where low complexity stimuli were combined to form a lew 
complexity product. Incorrect test stimuli were formed by 
reflecting or incorrectly rotating either the entire stimu.-s or 
one of its component .stimuli. 



Cons true ti on oJL Construction I terns 

Construction items were developed by choosing a target 
stimulus of the required complexity (low or high), and then 
cutting it into the appropriate number of pieces of niverage 
complexity equal to that required by the cell (low, medium or 
high). Low, medium, or high complexity stimuli could be combined 
to form either a low or high complexity product. Note that the 
level of stimulus complexity was defined as the average number of 
points on the two or three to-be-combined stimuli. 
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Apoara tu« ; 

All stimuli ware photogrsphed on Kodak ASA ^4 0 Super 8 film 
for presentation on a single frame Super 8 projector. Stimuli 
appeared as black silhouettes on a white background. Rotation 
arrous and construction "-f" signs were also black on a white 
background. Test stimuli and the "draw" instruction were 
silhouetted on a yellow background. The background on the ready 
and confidence frames was red. The film was projected on a 21 x 
28 cm rear screen. Average stimulus figure size within this 
field was approximately 8 cm. The distance between subject and 
screen was approximately 60 cm. Subjects controlled the 
presentation of stimuli by pressing either the hand held "true" 
and "false" buttons, or one of the four confi'dence buttons 
mounted on a small keyboard in front of the subject. 

Pressing any one of these buttons would cause the projector 
to advance to the next frame. A South West Technical Products 
6800 microprocessor recorded which key had been pressed, the 
elapsed time on the frame to the nearest hundreth of a second, 
and then advanced the projector to the next frame. The change 
from one frame to the next was virtually instantaneous. 

P rocedu^pQ 

Each subject participated in two sessions, each lasting 
approximately two hours. In the first session, subjects were 
given a general orientation to the experiment The four 
reference tests were then administered in tha following order: 
Cards (four m i nu t es/ pa r t ) , Copying (three mi nu t es/ pa r t ) , Figures 
(eight minutes total), and Memory for Designs. The 15 stimuli in 
the nemory for Designs test were drawn ou three by five inch 
cards and present ed individually for five secoi'><^». Subjects then 
drew each design after it was rei?...)V j They ' iold that the 

designs would be scored both for aj.^urac} orrect 
proportions . 

The 60 simple reaction time control trials were given next. 
Each trial began with a red background "raady" 1 i d e which 
remained on the sere en for 4.6 seconds*. Either a yellow or white 
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slide then appeared. Subjects uere told to answer the question 
"Is it yellou?" as rapidly as possible by pressing one of the 
hand held buttons. Subjects pressed the "yes" with their 
preferred hand. Each subject was first given 10 practice trials 
and then the 60 test trials. 

After a ten minute break 11 demonstration experimental 
trials uere given. Subjects uere told to uork as rapidly as 
possible but to be sure they had a clear image of the stimulus 
before advancing to the next frame. They uere told that a yellou 
background test probe could occur at any point during the item. 
Further^ they uere told that if they took longer than one second 
to respond to the test probe they uould not receive credit for 
the item. 

The demonstration uas folloued by nine practice items. On 
both practice and demonstration, questions, important 
distinctions snd errors uere explained by having the subject 
physically combine and rotate duplicate cardboard stimuli. 

Finally the first block of experimental trials uas 
presented. Subjects uere asked to rate their confidence in each 
ansuer on a four point scale (zero = guessing; three = certain). 
They uere asked to tell the experimenter if they accidentally 
pressed "yes*^ uhen they meant "no" or vise versa, and to press 
"no" if they could not perform a given construction or rotation. 
Finally, they uere told that their drauings uould be scored for 
both accuracy and correct proportions. 

In the second session, subjects uere first given ten 
practice trials from the first block, and then blocks tuo, three 
and -^our uith a 10 minute break betueen blocks. At the 
conclusion of the experiment subjects uere asked to explain how 
they we-e solving the items. They uere then thanked and paid 
three doll<?, rs pti- a;;jr for their participation. 
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CHAPTS^ HI 

RESULTS AND DISCUSSION 



Overvi eu 

This chapter reports the major results of the experiment. 
It is divided into six sections. The first section reports 
preliminary analyses on the formation of the various dependent 
measures, removal or replacement of outliers, computation of- 
individual guessing scores, and the formation of a new index of 
cor rec tness . 

The second section reports a series of regression analyses 
on the major speed-level hypothesis. These analyses ignore the 
design facets and examine the relationships between speed and 
level using a psychometric index of item difficulty. 

The third section reports the separate means analyses and 
correlational analyses on total timer correctness, and 
confidence. Here, the effects of the design facets on latency 
and correctness are examined in greater detail. The effect of 
practice on latency and correctness is first examined bri'efly. 
Then, the problem of reaction times for incorrect items is 
discussed and analyses on correct and all reaction times are 
revffiued. Separate analyses on total time and correctness are 
then reported in some detail. The interpretation of correlations 
between individual contrast scores and reference tests is 
discussed, and the contrast correlations for the separate 
analyses presented. The results of the total time and 
correctness analyses are then compared. 

In the fourth section, joint analyses on total time and 
correctness are presented and compared with the separate 
analyses. It is argued that the joint indices provide unique 
insights for a means analysis, but may distort the correlational 
analyses. 

In the fifth sectionr separate analyses on the encoding, 
c ons t rue t i on > rotation, and match component times are reported. 
The relationships between component tines and scores, and 
specific reference factors such as Closure Speed and Perceptual 




Speed are also discussed. 

In the sixth and final section, the major results of the 
total time and correctness analyses are compared with the results 
of the component time and correctness analyses. 



1. Prel iminary Ana 1 vs es 

Depend en t Measures 

Three types of scores were obtained on each trial: time, 
correctness and confidence. Time on each frame was recorded to 
the nearest hundredth of a second. Total trial time was 
determined by summing the component times, excluding, of course, 
the "ready" and "confidence" frame times, 

Tuo correctness measures uere obtained. The first 
represented the correctness of the subject's response to the test 
(match) stimulus. The second measure was at the component level. 
Subjects were requested to press the "no" button if they uere 
unable to perform a particular component operation. 

At the conclusion of the item, subjects rated their 
confidence in their response to the test stimulus on a zero 
(guessing) to three (certain) point scale. 



Outliers 
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product complexity. 

These scatterplots revealed that an arbitrary statistical 
rule for defining outliers uould be unuorkable. Times that uere 
extremely deviant for the simplest item types uere not 
necessarily extreme uithin ths total distribution. Only .3 to .7 
percent of the scores uere identified as outliers by this method. 
These scores uere either eliminated, or replaced by the next 
highest time in the cell, depending on the analysis. ' 
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Comput ation o f I_n dividual Guess ina Scores 

On any rating scales some subjects utilize the f?ntire scale» 
uhile others concentrate on one end of the scale. Thus, a new 
variable called GUESS was calculated for each subject by 
computing the probability of a correct response at each level of 
reported confidence. Guess was then defined as the level of 
confidence where the probability of a correct response was not 
greater than .5 at an alpha of .'0, 

A ^to^e S t r i ngen t Definition o f C orrectness 

Component times become uninterpretable if subjects do not 
perform the operations indicated on a given frame before 
advancing to the next frame. The random presentation of items, 
the unpredictability of the test probe, the draw trials, «nd the 
instruction to respond to the test probe within one second were 
all designed to minimize this problem. 

A number of different factors? including failure to perform 
component operations when requested, may result in an unusually 
long discrimination reaction time. The distribution of these 
"match" times was produced for each subject. All showed some 
degree of bimodality. The lowest point between the two peaks was 
then uf=ied to define a new variable called TMATCHN (N for Node) 
for each subject. These node times ranged from 1.6 to 3.2 
seconds, with a median of 2.0 seconds. An example of the 
distribution of match times and the location of TMATCHN for one 
subject is shown in Figure 3.1. 

These new indices were used to define an additional 
criterion of item correctness called SC0RE3. An item was counted 
correct only if it was correct (i.e. SCORE = 1), if confidence 
was greater than GUESS, if there were no "no" responses to 
component operations, and if match time was less than TMATCHN . 



The relationships between latency and correctness may be 
examined in two ways. The first ignores the design facets, is 
more atheoretic, briefer and rese.tibles the traditional 
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Figure 3.1. Distribution of match component times and the location of TMATCHN for subject HF. 
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psychometric type of analysis. The :cond examines the 
relationships between latency and correctness in the light of the 
design facets, assumes that these facets affect differential 
changes in performance, is more coiiiplex, but also more 
enlightening. This section reports four Jini>lyses of the first 
type. Analyses of the second type constitute the remainder of 
the chapter . 

Construct Validity of the Level Scores 

Before examining the speed-level hypothesis, it is first 
necessary to establish the construct validity of the level scor^. 
Correlations between total SCORE, total SC0RE3, total correct 
Draw trials, and the major reference tests are shown in Table 
3.1. Total SCORE correlated .75 with Paper Folding and ,66 with 
nenory for D cs i gns -Qua 1 i t y . The Draw total correlated .81 with 
Paper Folding, .77 with Surfaca Development, and .73 with Memory 
for Designs-Quality. Further, Draw and total correct 
intercor related .82. Clearly, individual differences in level on 
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tests were formed. The spatial cluster was composed /: Paper 
Folding, Form Board* Surface Development, Cards, Figures and 
Memory for Designs-Quality, The verbal cluster contained WAIS 
Information, Comprehension, Similarities, Vocabulary, and Word 
Beginnings and Endings. SCORE and SC0RE3 were then correlatea 
with the sum and the difference of these tuo clusters. The 
verbal plus spatial composite represents ^jeneral mental ability 
(in fact, it correlated .99 with WAIS Full Scale Score), while 
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Table 3.1 

Correlations between Total SCORE, Total SC0TE3, Total 
Correct Draw Trials, and Reference Tests (N=»30) 





Reference 


Total 


Total 


Total 


Test 


SCORE 


SCORES 


Draw 



Paper Folding 


75 


57 


81 


Form Board 


48 


45 


58 


Surface Developmnnt 


66 


61 


77 


Copying 


i*2 


32 


57 


Cards 


49 


41 


50 


Figures 


45 


54 


46 


Memory for Designs 


39 


41 


53 


Memory for Designs - Quality 


66 


63 


73 


Visual Number Span 


40 


42 


43 


Identical Pictures 


39 


26 


50 


Word Transformations 


36 


25 


43 


Camoflauged Words 


40 


04 


48 


Hidden Figures 


53 


30 


52 


Word Beginnings and Endings 


41 


21 


52 


Necessary Arithmetic Operations 


47 


21 


52 


Letter Series 


67 


41 


55 


Terman Concept Mastery 


51 


27 


64 


Raven Progressive Matrices 


45 


18 


49 


WAIS Information 


48 


26 


54 


Comprehension 


33 


25 


55 


Similarities 


46 


38 


56 


Digit Span 


51 


44 


36 


Vocabulary 


50 


24 


61 


Digit Symbol 


27 


15 


47 


Block Design 


48 


23 


50 


Picture Arrangement 


35 


41 


42 


Object Assembly 


28 


29 


51 


Verbal Scale Score 


53 


34 


64 


Performance Scale Score 


52 


39 


69 


Full Scale Score 


60 


41 


76 



Note . Decimals omitted. 
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the spatial minus verbal composite represents the s pa t i a 1 - v e r ba 1 
distinction. If S::0RE3 is a "cleaner" spatial score, it should 
correlate higher with the spatial end of the s pa t i a 1 -v er ba 1 
composite than SCORE The correlations uith the sum composite 
uere .73 and ,6^1, and u\tn difference composite .29 and .H2 for 
SCORS and bCORFB, respectively. Thus, SC0RE3 is a clean. 
mea:;ure ^f spatial ability than SCORE. 



Speed-L evel Correlat iona 

Items uere first ra'^.k ordered c^jcording to percent of the 
sami.le failing the item by SC0RE3. Total item time uas then 
regre sed on percent bailing, s'^parately for each subject. The 
analysis u^s ther repeat^sd using only thoso times for items 
correct by SC0RE3 . Since items uere ordered on a zero to one 
scale, the interceipt provided the best estimate of speed of 
solving the simplest item types. Pr dieted speed of solving the 
most difficul^ items uas estimated by simply summing the slope 
and intercept. Correlations betueen intercepts, slopes, 
predicted times at maximum difficulty for the tuo analyses, and 
reference spatial tests ara shoun in Table 3.2. Table 3.3 
contains a similar set of correlations, this time excluding the 
scores for one subject uho had an extremely steep slope. 

The most important correlations are those betueen the tnree 
variables that represent level on this task (SCORE, SC0RE3, Drau) 
and the intercepts and predicted times at maximum difficulty. In 
all four analyses, the intercepts uere independent of the level 
scores. On the other hand, predicted times at maximum difficulty 
uere highly correlated uith total SCORE and total Drau, but 
unrelated to total SC0RE3. 

Examination of the correlations uith reference tests explain 
this discrepancy betueen the SCORE and SC0RE3 correlations uith 
predicted times for difficult items. In ell ' r analyses^ the 
intercept correlated highest uith a strategy index, number of 
pencil marks each subject made on the Form Board lest. The 
correlations uere strongest in the analyses on correct times. 
When all times uere included, the correlation uith Closure Speed 
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Table 3.2 

Correlations between Slopes, Intercepts, Predicted Times 
at Maximum Difficulty, and Reference Tests (N=30) 



Reference Test 


All 


Times 




Correct 


Times 


Only 


Intercept 


Slope 


T 


Intercept 


Slope 


T 


Paper Folding 




40 


35 








Form Board 


-33 












Form Board - Marks 


41 






54 






Surface Development 


-38 


32 










Copying 


-36 












Cards 














Cards - Wrong 




49 


45 




59 


53 


Figures 














Closure Speed 


-45 












Visual Number Span 




34 


39 




33 


38 


Identical Pictures 














Object Assembly 


-40 


35 




-31 






Memory for Designs- Quality 












Word Beginnings & Endings 




47 


40 




36 


32 


Necessary Arith. Operations 


44 


39 




40 


37 


Terman Concept Mastery 




54 


53 




50 


49 


Information 




46 


44 




44 


43 


Comprehension 




44' 


36 




42 


35 


Similarities 


-36 


38 




-30 






Digit Span 




46 


39 




32 




Vocabulary 




54 


49 




47 


44 


Verbal Scale Score 




58 


50 




50 


44 


Performance Scale Score 


-33 












Full Scale Score 


-32 


52 


42 




40 


34 


Draw Total 




58 


53 




45 


44 


Total SCORE 




56 


56 




42 


46 



Total SC0RE3 



Note. Decimals and correlations less than ,30 omitted. 
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Table 3.3 

Correlations between Slopes, Intercepts, Predicted Times 
at Maximum Difficulty, and Reference Tests (N=29) 



Reference Test 


All 


Times 




Correct Times 


Only 


Intercept 


Slope 


T 


Intercept Slope 


T 


Paper Folding 




53 


A6 


42 


37 


Form Board 


-34 


36 








Form Board - Marks 


41 




37 


53 


30 


Surface Development 


-38 


48 


32 


33 




Copying 


-38 










Cards 












Figures 












Closure Speed 


-47 






-34 




Visual Number Span 




39 


44 


44 


47 


Identical Pictures 




40 


32 






Hidden Figures 




50 


50 


40 


41 


Word Beginnings & Endings 


48 


38 


35 




Necessary Arith. Operations 


44 


36 


42 


35 


Letter Series 




43 


47 


41 


45 


Terman Concept Mastery 




52 


48 


49 


46 


WAIS Information 




43 


41 


46 


42 


Comprehension 




30 








Similarities 


-36 


37 








Digit Span 




46 


37 






Object Assembly 


-40 










Verbal Score 




55 


43 


46 


36 


Performance Score 


-33 


37 








Memory for Designs-Quality 


61 


56 


59 


54 


Draw Total Correct 




70 


61 


62 


55 


Total SCORE 




69 


66 


59 


58 


Total SC0RE3 




40 


33 


32 





Note. Decimals and correlations less than .30 omitted. 
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became significant. When the outlier was removed (Table 3.3), 
correlations between the intercept and other spatial tests 
surf aced, although none reached significance. 

The slope* on the. other hand, had its highest correlations 
uith verbal tests in the full sample analyses (Table 3.2). Here, 
when all >times were included, slope correlated highest uith WAIS 
Verbal Scale Score, Terman Concept Mastery, WAIS Vocabulary, and 
number of errors on the Cards test. Later results will show that 
Cards Wrong is actually an index of verbal strategy, not spatial 
ability. When only correct times were included in the analysis, 
slope corralated exclusively uith verbal tests and Cards Wrong. 

When the outlier was removed, the correlations presented 
more of a mixed bag. Here (see Table 3.3), slope correlated 
highest uith Mer^ory for Designs - Quality, WAIS Verbal Scale 
Score, Terman Concept Mastery, and Hidden Figures. However, on 
balance, the correlations were still stronger uith verbal tests 
than spatial tests. 

In sum, the major hypothesis of this investigation is 
supported by these results, but uith an unexpected tuist. Speed 
of solving simple items was unrelated to level on the task. 
Speed of so*lving difficult items uas highly correlated uith tuo 
of the level scores. Houever, it was expected ihat these 
predicted times at maximum difficulty uould be negatively 
correlated uith spatial reference tests. Instead, predicted 
times for difficult items uere positively correlated uith verbal 
r ef er enc e tests . 

Subsequent analyses on correctness and latency uill clarify 
these relationships betueen verbal ability, spatial ability, and 
speed and level scores on the experimental task. This is 
accomplished by examining the structure of cell means, through 
analysis of variance, and the relationships betueen individual 
differences in patterns of scores over design facets and 
reference tests, through contrast and cell total correlations. 
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1 1 1 ♦ Separate Ana 1 ys es on Total T i me > Correctness and Conf idence ♦ 
P rac t ice E f f ec t s 

For the group data* there was a significant decreiise in 
total time over the four blocks. Mean times averaged over 
subjects and items within a block were 19.46, 16.18, 14.92, and 
13.06 sec for blocks one through four, respectively. An analysis 
of variance indicated that while the linear term captured most of 
the variance (F = 255.51) the quadratic and cubic terms were also 
significant (F = 6.09, p = .0138 and F = 4.16, p = .042, 
res pec t ivel y ) . 

At the individual level, nine subjects showed no change or 
minimal change in RT ever blocks, thirteen showed a definite 
decline in RT, and eight had other patterns (such as a large drop 
between block one and two, but steady thereaite*, or a gradual 
increase in RT over blocks). 

Subjects also tended to make fewer errors with practice. 
The percentage of correct responses was .7S, .80, .83 and .83 for 
blocks one through four respectively. The corresponding values 
for SC0RE:3 were .60 , .67, .72, and .72. 

RT for Incorrect 1 1 ems 

When a subject fails to solve an item, the reaction time for 
that item is ambiguous. While some investigators regularly 
include both correct and incorrect reaction times in their 
analyses (e.g., Sternberg, 1977), the usual procedure is to 
exclude times from incorrect items (Pachelia, 1974). However, 
this experiment differs from the classical RT study in two 
important ways : 

1. Subjects were deliberately selected to 
represent a wide range of individual differences in 
spatial ability. Using only reaction times for correct 
it ems would bias the results by weighing more heavily 
the performance of the more able subjects. 

2. Similarly, items were constructed in order to 
represent a wide range of item difficulty or 
complexity. An analysis of correct reaction times 
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uould be biased, giving more weight to the easier 
i t ems . 

In any case the issue of whether to include only the "clean" 
correct RT ' s or all RT ' s in an analysis is an emp^ricai matter. 
As a preliminary check, an analysis of variance was performed on 
all reaction times (except 30 outliers, which were excluded) and 
on the subset of these times where the item was correct and 
confidence was greater than GUESS. The results are reported 
elsewhere (see Lohman, 1979b), Note that these are not repeated 
measures analyses of variance. Computer limitations precluded 
doing this type of analysis on the incomplete data matrix defined 
by the second analysis. Therefore, all subject factor 
interactions were included in the error term. Both analyses 
followed a regression model in which main effects were entered 
first, two way interactions second, etc. 

Including all times in the analysis added some noise. The 
squared multiple correlations were .352 and .419 Icr the complete 



and restricted 


dat 


a sets, respectively. A feu 


i n t e r a c 


t i ons were 


significant in 


one 


analysis and nonsignificant 


i n the 


other , but 


on the whole, the 


results were quite similar. 


This do 


es not mean 
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important psychological dif 
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ect reaction times. Rather^ 
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alter the results. 
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A repeated 


measures analysis of variance 


on total 


time is 


shown in Table 


3 . 4 


Times for both correct an 


d i nc 0 r r 


ect items 


were inc luded i 


n the analysis. Thirty extreiDe 


0 bs e r va 


t i ons were 



replaced by the next highest within cell time. 

All main effects uere in the expected direction, except 
match* which was pot significant. Construction, rotation, and 
stimulus complexity all had strong linear components and weak, 
but significant quadratic components. These main effects are 
plotted in column one of Figure 3.5> below. Note also that there 
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Table 3.4 



Analysis of Variance on Total Time 



Source 






tr 

r 


Source 


A f 


no 


r 


Mean 


1 
1 


1624939 . 49 


2oJ . iU'^ 


M 

IN 


zy 


A 1 7 A 11 




Construction 


(2) 






NC 


C Q \ 

(58; 






Linear 


1 


131612.26 


127 .81* 


Linear 


29 


1029 . 76 


O O 7 O * 

jj . /2* 


Quadratic 


I 


2089. 17 


64 . 61* 


Quadratic 


29 


J2 . Jif 


1 . Uo 


Rotation 


(2) 






NR 


/ C Q \ 

(58) 






Linear 


1 


33597 . 19 


121 . 55* 


Linear 


29 


276 . 40 


A A C X 

9 . 05* 


Quadratic 


I 


2295.83 


35 . 29* 


Quadratic 


29 


^ C AC 

65 . 05 


O 1 o * 

2.13* 


Stimulus Complexity 


/ON 

(2) 






NS 


/ C Q \ 

(58) 






Linear 


I 


22628 . 14 


54 .38* 


Linear 


29 


A 1 ^ 1 O 

416.12 


1 J . 0 J* 


Quadratic 


I 


298 . 99 


6.01 


Quadratic 


29 


AO T Q 

49 . /o 


1 . 0 J 


^jMatch 
Type or Addition 


I 


23.66 


.69 


NM 


29 


j2 . Zo 


1 . IZ 


1 


2787 . 89 


47 .4 J* 


IN 1 


zy 


JO . / J 


1 . y z^ 


Product Complexity 


I 


16867 .77 


38 . 58* 




Z 5 


A O 7 1 fi 

hJ/ . lo 


1 A 09 + 

iH . JZ^ 


CR 




359. 27 


11 11^ 
1 1 . 1 




1 1 A 


jZ . 


1 . UO 


CS 




1009 . 25 




MP C 


i 10 


A A 7 ^ 


1 A 7* 


CM 


2 


15.62 


. 79 


NCM 




1 Q 7 A 
ly . / 0 


. 0 J 


KT 


I 


3003.60 


38.51* 


NKT 


29 


7Q nn 
/ o . UU 


2 . J j'^ 


KP 


I 


4026.37 


3-1 . 07* 


NKP 


29 


1 O Q ^ Q 

129 . JO 


A 0 A * 


RS 


4 


118.33 


4 0 1 * 


KRS 


1 16 


29 • 49 


Q 7 

. 9 7 


RM 


2 


276.42 


5.88* 


NRM 


58 


/ T AO 

47 . 02 


1 C / ^ 

1 . 54* 


RT 


2 


976.06 


16.76* 


NRT 


58 


58. 24 


1.91* 


RP 


2 


1006. 13 


22.32* 


NRP 


58 


/ C AO 

45 . 08 


1 . 48 


.SM 


2 


46.96 


1 .48 


NSM 


58 


O 1 T C 

31.75 


1 A/ 

1 .04 


ST 


2 


1175.67 


18 .06* 


NST 


c o 

58 


65 . 09 


2.13* 


D Jt 




1 067 99 


ID 33* 


NSP 


58 


103.38 


3.39* 


MT 


1 


632.59 


13.23* 


NMT 


29 


47.83 


1.57 


MP 


1 


1195.24 


11.80* 


NMP 




101.26 


3.32* 


1? 


1 


2834.35 


26.24* 


NTP 


29 


108.02 


3.54* 


CR« 


8 


124.37 


2.87 


NCRS 


232 


43.29 


1.42* 


CRM 


4 


378.73 


8.69* 


NCRM 


116 


43.58 


1.43* 
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566.70 
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47.59 


1.56* 
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Table 3.^ (continued) 



Source 


df 


MS 




F 


Source 


df 


MS 




F 




KRT 


2 


806 


.78 


23.73* 


NKRT 


58 


34. 


00 


1 . 


11 


KRP 


2 


189 


.15 


5.39* 


NKRP 


58 


35. 


07 


1 . 


15 


KST 


2 


1093 


.37 


16.80* 


NKST 


58 


65. 


09 


2. 


13* 


KSP 


2 


1215 


.51 


32.46* 


NKSP 


58 


37. 


45 


1. 


23 


KMT 


1 


298 


.62 


2.80 


NKMT 


29 


106. 


69 


3. 


49* 


KMP 


1 


28 


.17 


.55 


NKMP 


29 


51 . 


59 


1. 


69 


KTP 


1 


1446 


.99 


15.82* 


NKTP 


29 


91 . 


48 


3. 


00* 


RSM 


4 


714 


.50 


18.92* 


' NRSM 


116 


37. 


77 


1 . 


24 


RST 


4 


408 


.47 


8.37* 


NRST 


116 


48. 


80 


1. 


60'-* 


RSP 


4 


347 


.32 


5.01* 


NRS? 


116 


69. 


37 


2. 


27* 


RMT 


2 


1007 


.53 


31.00* 


NP^: 


58 


32. 


50 


1. 


06 


RMP 


2 


2547 


.56 


19.97* 


NRMP 


58 


'l27. 


56 


4. 


18* 


RTF 


2 


261 


.53 


10.66* 


NRTP 


58 


24. 


54 




80 


SMT 


2 


152 


.07 


2.37 


NSMT 


58 


64. 


07 


2. 


10* 


SMP 


2 


773 


.87 


11.92* 


NSMP 


58 


64. 


§?. 


2. 


13* 


STP 


2 


101 


.94 


3.05 


NSTP 


58 


33. 


37 


1 

i, . 


09 


MIP 


1 


71 


.89 


1.38 


NMTP 


29 


52. 


09 


1. 


7i 


CRSM 


8 


212 


.70 


6.05* 


NCRSM 


232 


35. 


17 


i o 


Id 


KRST 


4 


1325 


.56 


16.46* 


NKRST 


116 


80. 


54 


2. 


64* 


KRSP ' 


4 


1649 


.01 


15.25* 


NKRSP 


116 


108. 


11 


3. 


54* 


KRMT 


2 


142 


.35 


2.91 


NKRMT 


58 


49« 


00 


1. 
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KRMP 


2 


1370 


.03 


15.85''' 


NKRMP 


58 


86. 


42 


2. 
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KRTP 


2 
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.55 


3.94 


NKRTP 


58 


40. 


20 


1. 


32 


KSMT 


2 


281 


.82 


3.85 
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58 
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28 


2. 


40* 


KSMP 


2 
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.36 


1 .85 
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58 
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1. 
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KSTP 


2 
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.06 
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58 
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55 
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1 


45 
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29 
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67 


RSMT 


4 
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.71 


2.51 
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04 


1. 
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4 
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.25 
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80 


2. 
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4 
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.99 


3.32 
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35. 


54 


1. 


16 
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2 
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.32 


10.81 
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58 
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77 


2. 
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2 


933 


.65 


13.07* 
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58 
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45 


2. 


34* 
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Table 3.4 (continued) 



Source 


df 


MS 


F 


Source 


df 


MS 


F 


KRSMT 


4 


53.22 


.90 


NKRSMT 


116 


59.22 


1.94* 




4 


396.04 


5.78* 


NKRSMP 


116 


68.48 


2.24* 


KRSTP 


4 


433.76 


8.55* 


NKRSTP 


116 


50.71 


1.66* 


KRMTP 


2 


447.89 


7.33* 


NKRMTP 


58 


51.07 


2.00* 


KSMTP 


2 


482.70 


3.63 


NKSMT? 


58 


132.86 


4.35* 


Rsm? 


4 


796.33 
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NRSMTP 


] 16 


82.02 


2.69* 


KRSMT? 


4 


1237.93 


11.33* 


NKRSMTP 


116 


109.22 


3.58* 










Residual 


1620 


30.54 





*p less than .01 
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uere many large interactions. Thus, a simple additive model does 
not fit the data well. 

All tuo-uay interactions except CM and SM uere significant. 
The eight largest of these are plotted in Figure 3.2. The 
construction by rotation interaction indicates that the time 
taken to rotate and match a constructed stimulus was slightly 
greater at 180 degrees for two piece additions than one piece 
add i t i ons . 

Although the type of addition main effect was significant, 
the construction by type (KT) interaction reveals that left 
additions took longer only for two piece additions. Similarly, 
the type of addition by product comlexity (TP) interaction 
reveals that left and right additions at low product complexity 
took the same amount of time, while left additions uere much more 
time consuming than right additions at high product complexity. 

The construction by product complexity (KP) interaction 
reflects the fact that high product complexity additions uere 
relatively more time consuming for tuo pieci additions than one 
piece additions. Note that in all interactions betueen 
construction and type of addition or product complexity, 
construction is represented by K. This is necessary in order to 
distinguish betueen the tuo level c ons t rue t i on ■ c on t ras t betueen 
tuo piece additions and one piece additions (K), and the C linear 
and quadratic contrasts that span the entire three levels of the 
construction facet. Discussion of the RT and RP interactions is 
best postponed until after the contrast correlations are 
presented and correctness and latency are examined jointly. 

Ob iections t_g Cont ras t Correlations 

Tuo apparent limitations of the correlations betueen these 
contrast scores and reference variables merit brief comment. 
First, correlations based on only 30 cases are usually quite 
un table. Houever, subjects uere not selected randomly; rather 
they uere deliberately chosen to be a representative sample of 
verbal and spatial ability in the larger population (N = 171 high 
school and college males). Therefore, the correlations reported 
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Figure 3.2 Plots of the eight largest two way intetactions for total time. 
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here are much more dependable than the sample size suggests. 

Second, all contrasts except the M contrast are difference 
scores. Difference scores are known to be relatively unreliabl 
(Lord, 1963; Cronbach and Furby, 1970). Five lactors mitigate 
these complaints. 

1. Each contrast score is based on at least 144 
observations and as many as 216 observations for 
each subject. Thus, the means and differences 
between them should be quite stable. 

2. As Cronbach and Furby (1970) point out, linked 
errors can substantially enhance the reliability of 
difference scores. Since all observations were 
obtained from one task there necessarily are linked 
errors in these scores. Hence, the reliability of 
the difference scores is greater than would be 
estimated by conventional methods (i.e.. Lord, 
1963) . 

3. All reliability coefficients are merely estimates 
of score reliability. Low estimated reliability 
may help explain why a score fails to correlate 
with other variables. However, if the reliability 
estimates are low, and the correlations are high, 
then it is the reliability estimate that needs 
explaining. Further, even within the rigid 
assumptions of classical test theory, it is the 
square root of the estimated reliability that 
defines the limit in validity. Thus, if the 
reliability of a test is only .25, the test may (in 
theory) correlate .50 with another variable. 

It can be shown (see L ohman , 1979b) that the 
correlation between the difference score X-Y and 
another variable Z will be greater than the 
difference between r(X,Z) and r(Y,Z) if r(X,Y) is 
greater than .50. If r(X,Y) is .80, then r(X-Y,Z) 
= 1.58 (r(X,Z)- r(Y,Z)l. If r(X,Y)=.95, then r(X- 
Y,Z) = 3.16 I r(X,Z)-r(X,Y) ) . Thus, when cell 
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i n t e rc o r r el a t i ons are highf as in this taskr 
correlations between individual contrast scores and 
reference tests may be quite high. rurther, since 
the coatrast score correlation with a reference 
test will be much larger than the difference 
between corresponding correlations betwr^n each 
cell and the referen'::e testr It is much easier to 
detect differ en tial correlations between 
performance over a task and reference tests using 
the contrast correlations. 
5. In most educational e ppl i c a t i ons f the absolute 

magnitude of the gain score is frequently about the 
size of the standard error of measurement of the 
raw scoreSf and much smallf=^r than the range of 
individual differences in the raw scores. However^ 
the magnitude of the difference score over a 
powerful facet in a within subjects design may be 
much larger than the standard error of the raw 
scores and even larger thm the the range of 
individual differences in the raw scores. 
Thus, contrast scores may be more dependable and more useful 
than is commonly assumed in the psychometric literature on 
difference scores. Nevertheless, there are three important 
limitations of contrast scores that must be noted: 

1. Even though contrast scores may be more dependable than the 
psychometric literature suggests, they are still less 
reliable than their corresponding raw scores. 

2. Ceiling and floor effects in either raw score can destroy the 
validity of the difference score. 

3» While analysis of variance procedures are relatively robust 

with respect to violations of the enual variance assumptions, 
correlations a^^e not. Thus, correlations between contrast 
scores and other variables can be seriously biased if the 
within cell variances are not approximately equal. In 
particular, such correlations will tend to reflect the rank 
order of raw scores in the cell with the greater variance.^ 
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Total Time Cont ra3t Correlations 

Significant correlations between reference tests and the ten 
total time main effect contrasts are summarized in Table 3.5. 

H £.9 n t r ^ g,t . The N contrast represents subject means. Mean 
total time cjorrelated . ^9 ui'h total SCORE but only .18 uith 
total SC0RE3. Therefore, those uho take longer tend to get more 
itfms correct. Houever, uhen scores are corrected for guessing 
and extremely long match times, mean tim. and correctness become 
independent. Ths only othet- si.nificant N contrast correlation 
uas .^3 with Terman Concept Ma tery. Correlatirns uith other 
verbal tests uere positive, but n-^t significant (i.t., .37 uith 
MAIS Information, and .35 uith Vocabu ' j=? ry ) , uhile correlations 
uith spatial tests uere all much smaller, and, of course, not 
significant. Thus, if anything, mean time appears to be more 
related to verbal ability than i»patial ability. 

Cong » ruction Contras ts. The construction linear contrast 
represents the difference: betueei. average time on tuo piece 
additions and no addition items. Scores on this contrast uere 
most cons is t u*n 1 1 y correl-^ted uith verbal tests and number of 
errors on the Cards test* The quadratic contrast represents the 
difference betueen the average of no additions plus two piece 
additions, and one pisce additions. Hiih positive scores reflect 
littlr change betueen zero addition and one addicion, and a large 
increase between one and tuo additions. H-^gative scores reflect 
the reverse pattern. The contrcist corielated negatively uith 
Visual Number Span (r = -.U:i) and Hidden rig;)res (r = -.51) and 
positively uith Marks Visual imagery Qu t i ona i r e (r=.43). 
iherefore, those uho reported vivid imagery .ere more likely to 
shou the most change betueen no adJitio-^s and one ar'dition, uhile 
those uith high scores on Hidden Figures and Visual Number Span 
shoued larger increares betv^een one and tuo piece additions. The 
most important finding here uas the opposition betueen Hidden 
Figures, uhich is probably the most "anal/tic" spatial ^est in 
the battery, and the reported vividness of visual imagery. This 
suggests that an analytic approach to the construction task can 
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Table 3.5 

Correlations between Total Time Main Effect Contrast Scores and Reference Tests (N-30) 



M 




C Linear 




C Quadratic 




R Linear 




R Quadratic 




Total SCORE 


A9 


Cards Wrong 


— 

52 1 


Marks Imagery Q. 


A3 


Carv'S Wrong 


57 


Picture Compl. 


5A 


Terman Concept M« 


A3 


Total Draw 


A6 


Vis. Numb. Span 


-A5 














Vis. Numb. Span 


A5 


Hidden Figures 


-51 














Terman Concept M. 
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Information 


AA 


















Vocabulary 


A7 














— 




WAIS Verbal S. 


A7 














S Linear 




S Quadratic 




M 




T 




P 




WAIS IQ 


50 


WAIS IQ 


6A 


Fnnn Rnnrrl 








WAIS IQ 


60 


Paper Folding 




Paper Folding 


A7 


Cards Wrong 


65 


Paper Folding 


AS 


Paper Folding 


57 


Terman Concept M. 


AA 


Identical Picture 


53 


Vis Numb Span LC 


51 


Aud Lett Spai: LC 


A3 


Form Board 


A6 


Comprehension 


A8 


Finding A's 


50 


Picture Compl. 


-A9 


Word Transform. 


A8 


Surface Develop. 


50 


Digit Span 


51 


Word Begin/E^id 


5A 






Terman Concept M. 


52 


Mem for Deslgns-Q 57 


Vocabulary 


51 


Kec. Arith. Oper. 


A7 






Information 


50 


Identical Picture 


A5 


WAIS Verbal S. 


60 


Terman Concept M. 


A7 






Vocabulary 


A6 


Information 


A3 






Information 


A2 






Object Assembly 


62 


Comprehension 


A2 






Comprehension 


51 






WAIS Verbal S. 


A7 


Similarities 


A3 






Similarities 


57 






WAIS Perform. S. 


A6 


Digit Span 


A9 






Digit Span - F 


61 










Vocabulary 


56 






WAIS Verbal S. 


61 










Object Assembly 


A9 






WAIS Perform. S. 


52 










WAIS Verbal S. 


59 


















WAIS Perform. S. 


AS 



Note. Decinals and correlations less than .A2 omitted. 
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be distinguished from a more holistic imagery approach. The tuo 
variables were associated with different patterns within the 
taskr even though neither was significantly related to overall 
p er f o rmanc e . 

RQt^tign Con t rasts . There were only tuo significant 
correlations between the rotation contrasts and reference tests. 
The linear contrast correlated .57 with errors on the Cards test* 
and the quadratic contrast correlated .54 with UAIS Picture 
Completion. Later results suggest that errors on the Cards test 
are related to verbal strategies. Thus, those who showed the 
greatest time differences between no rotation and 180 degree 
rotation may have been using nonspatial si itegies to rotate the 
s t irau 1 i . 

Stimulus Complexity . Both the linear and quadratic stimulus 
complexity contrasts correlated strongly with verbal ability. 
The linear contrast correlated significantly with seven reference 
tests, and only one of these was a spatial test (Paper Folding, 
r=.44). The highest correlation was with WAIS Verbal Scale Score 
( r= . 60 ) . 

The stimulus complexity quadratic contrast correlated 
significantly with fouteen reference tests. Ten of these were 
verbal or verbal reasoning tests, three were perceptual speed 
tests, and one was a spatial test (Paper Folding, r=.47). The 
highest correlations were .61 with WAIS Verbal Scale Score and 
Digit Span Forward. Thus, those who showed the largest increase 
in total time between low and raediuiti complexity stimuli had low 
scores on these reference tests, while those who showed the 
greatest increase between medium and high complexity stimuli 
scored highest on these reference tests. 

Ha tch , Although the match facet was not significant in the 
analysis of variance, it did have four large correlations with 
reference tests. Scores on this contrast represent the 
difierence between time taken to accept a correct te:Dt i^xmulus 
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and the time required to reject an incorrect stimulus. Thus, 
positive scores reflect longer acceptance times and negative 
scores longer rejection times. The contrast correlated 
negatively uith Form Board and Picture Completion, and positively 
with Cards Wrong and Visual Number Span* Therefore, those who 
were faster accepting a correct probe scored higher on Form Board 
and Picture Completion, while those who were faster rejecting an 
incorrect probe made more errors on the Cards test and scored 
higher on the Visual Number Span test. Both of these latter 
variables were associated with verbal strategies throughout the 
analysis. Form Board and Picture Completion, on the other hand, 
are good examples of spatial tests that place a premium on 
comparing a constructed or remembered stimulus with a given 
stimulus. Therefore, as suggested in the comparison of SCORE and 
SC0RE3, match times may provide a useful index of solution 
strategy. 



Type of Addition . Individual scores on this contrast 
correlated significantly with eight reference tests. The highest 
correlation was .62 with WAIS Object Assembly. Thus, those who 
obtained higher scores on this test took much longer on left 
additions than on right additions. Six correlations (ranging 
from r=.43 to .52) were with verbal tests» and one with another 
spatial test (Paper Folding, r=.48). Therefore* excepting the 
strong correlation with Object Assembly, time once again was 
related to verbal ability. 



P roduc t Compl exi tv . Contrast scores on this facet 
correlated with both spatial tests and verbal tests. Therefore, 
the effect is most clearly associated with general ability. Hi ^h 
ability subjects took much longer on high complexity products 
than low complexity products. Further, this contrast u^s the 
best predictor of overall performance on this task. The 
correlation with total SCORE was .65* and with Total Correct Draw 
trials ,73. 




' Angl vses sir correctness 

n^^ng ATialvsis ■ 

Analyses of variance for SCORE and SC0RE3 are shown in 
Tables 3.6 and 3.7. Estimated variance components and percent of 
variance associated with each facet in the SC0RE3 analysis are 
reported elsewhere (see Lohman, 1979b). The data were first 
collapsed over the match and type of addition facets* except at 
level one of construction, where match was treated as 
replications. Each cell thus contained four cbs er va c 1 ons . Cell 
means were then computed with two means per cell ac level one of 
the construction facet and one mean at levels two and three. 
Replications within the level one construction cells provided an 
estimate l1S(e) used to test the subject-facet interactions. 

The two analyses produced qui te d i f f erent results. In 
particular, the linear contrast for stimulus complexity was 
highly significant in the SC0RE3 analysis, but nonsignificant in 
the SCORE analysis. Product complexity was also highly 
significant in the SC0RE3 analysis and only marginally 
significant in the SCORE analysis. On the other hand, the 
stimulus complexity by product complexity interaction was much 
stronger in the SCORE analysis. 

The major difference between these two variables is that a 
subject could obtain credit for an item under SCORE when he 
guessed, had little faith in his answer or took an inordinately 
long time to accept or reject the test stimulus, SCORES 
attempted to remedy these deficiencies. Correlations between 
total SCORE, total SC0RE3, and reference tests previously 
examined (see Table 3.1) indicated that SC0RE3 was a cleaner 
measure of spatial ability. The analyses o! variance on the two 
variables confirm this. SC0RE3 was a better measure of spatial 
ability than SCORE on two counts. Its internal validity, as 
measured by congruence with the model, and external validity, as 
measured by differential correlations with reference tests, weio 
both higher. 

Plots for the cons t r .-.c t i on , rotation, stimulus complexivy, 
and product complexity l'::0RS3 main effect?:> are shown in the 
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Table 3.6 
SCORE Analysis of Variance 



Source 


df 


MS 


F • 


Source 


df 


MS 


F 




X 


insi 7L7 


Z 7Z 1 • DZ** 






• J>OU 




L«OU»5 UC^Uv. CXOU 
















LilnefliT 


1 
1 


•070 




Linear 




HA ft 
• U4 0 


•5 7 ft* 
Z • / 0** 




1 
1 


9 HA A 


An ft 7* 


i^uaarauxc 




• UDU 


9 on* 


Rotation 


(2) 






NR 








Linear 


1 


.994 


16^56* 


Linear' 


29 


• 060 


3^47* 


Quadratic 


1 


• 081 


2.32 


Quadratic 


29 


• 035 


2.03* 


Stimulus Complexity 


(2) 






NS 








Linear 




• 058 


3^63 


Linear 


29 


016 


93 


Quadratic 


1 


• 005 


• 14 


OuAfirat *{ c 


29 




2*08* 


Product Coaplexity 


1 


• 363 


6^85 


NP 


29 


• \J ^ ,J 


3.07* 


CR 




• 166 




NCR 


116 


*033 


1 • 9o* 


CS 


4 


• 240 


6^73* 


NCS 


116 


• 0^4 


o n7<f 
2 • 07^ 


KP 


i 


• 063 


1 •30 


NKP 


29 


• 049 


/ . O'f 


RS 


/ 


.048 


^•73 


c 

NtvO 


lib 


0 Q 


1 c^Q* 


RP 


2 


.023 


• 72 


NRP 


Do 


HQ 0 
• UJ^ 


1 f-f^* 
! • OJ** 


SP 


2 


.359 


9^32* 


NSP 


58 


• 038 


2^23* 


CRS 


8 


.086 


2^86* 


NCRS 


232 


• 030 


1.73* 


KRP 


2 


.023 


:/6 


NKRP 


58 


• 038 


2^20* 


KSP 


2 


.620 


J4.95^ 


msi' 


58 


• ^041 


2.A0^^ 


RS • 


4 


.254 


7.>63* 


NRSP 


116 


• 033 




'^RSP 


4 


.20A 


5.78* 


NKRSP 


116 


• 035 












Residual 


270 


• 017 





* " p less than <01 
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Table 3.7 



SCORES Analysis o£ Variance 





Source 


df 


MS 


F 


Source 


df 


MS 


F 


Mean 




740.133 


1030.82* 


N 


29 


.718 




Construction 


(2) 






NG 


(58) 






Linear 




14.118 


160.43* 


Linear 


29 


.088 


3.34* 


Quadratic 




4.129 


74.44* 


Quadratic 


29 


.057 


2.17* 


Rotation 


(2) 






NR 


(58) 






Linear 




2.827 


27.19* 


Linear 


29 


.104 


3.95* 


Quadratic 




.039 


.88 


Quadratic 


29 


.044 


1.67 


Stimulus Complexity 


(2) 






NS 


(58) 






Linear 




1.387 


33.82* 


Linear 


29 


.041 


1.56 


Quadratic 




.000 


.01 


Quadratic 


29 


.024 


.91 


Product Complexity 




5.703 


64.80* 


NP 


29 


.088 


3.34* 


CR 




.270 


6.35* 


NCR 


116 


.042 


1.62* 


CS 




.284 


7.33* 


NCS 


116 


.039 


1.47* 


KP 




.029 


.49 


NK? 


29 


.059 


2.24* 


RS 




.046 


1.23 


NRS 


116 


.038 


1.43 


RP 


2 


.198 


3.97 


NRP 


58 


.050 


1.90* 


SP 


2 


.206 


4.13 


NSP ' 


58 


-.050 


1.90* 


CRS 


8 


.135 


3.49 


NCRS 


232 


.039 


1.47* 


KRP 


2 


.100 


2.83 


NKRP 


58 


.036 


1.35 


KSP 


2 


.789 


15.00* 


NKSP 


56 


.053 


2.00* 


RSP 


4 


.209 


4.51* 


NRSP 


116 


.046 


1.76* 


KRSP 


4 


.603 


14.79* 


NKRSP • 


116 


.041 


1.55* 










Residual 


270 


.026 





* = p less than .01 
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second column of Figure 3f.5 (belou). Both rotation and stimulus 
compleKity show linear declines in correctness. The important 
contrast for construction was between no addition (i.e.f 0=1) 
and addition CC = 2^3). 

Con t ras t Correlat i ons ■ 

Correlati ons between SC0RE3 main effect contrast scores and 
reference tests are summarized in Table 3.8. 

Correlations between the N contrast and reference tests were 
previously discu-ssed (see Table 3.1). To r ec a p i t u 1 a t e f mean 
SC0RE3 scores correlated significantly with ten spatial or Gf 
reference tests, and no verbal tests. Individual construction 
linear and quadratic contrast scores showed strong differential 
correlations with several reference spatial tests. The largest 
differentials were with Form Board and WAIS Object Assembly. The 
linear contrast scores correlated .53 and .56 with Form Board and 
Object Assemblyf while the corresponding quadratic contrast 
correlations were -,52 and -.51. Thus, high scores on these 
tests are reflected in a linear decline in correctness over the 
construction facet. 

The only significant correlation for the rotation contrast 
scores was a correlation of .^4 between the linear contrast and 
Figures. The correlation with Cards was similar but 
nonsignificant (r=.39). The rotation quadratic component 
correlations were not significant. 

While the stimulus complexity constrasts had no significant 
correlations with reference tests, the product complexity 
contrast correlated negatively with two WAIS verbal subtests and 
the Verbal Scale Score (r = -.t;3). Thus, large decrements in 
performance over the product complexity facet are more strongly 
associated with verbal ability than spatial ability. 

ErelJ^mXnarv Anal vs i s oi Confidence 
Although the confidence data were used primarily to help 
"clean up" the correctness index, they are of some interest in 
their own right, }n generals the analysis of confidence mirrored 
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Table 3.8 

Correlations between SC0RE3 Main Effect Contrast Scores and Reference Tests (N=30) 



N 




C Linear 




C Quadratic 






ji 


Form Board 


53 


IQ 


-45 




00 


Copying 


42 


Form Board 


-52 


ruiui DUdlU 


HO 


Figures 


52 


Mem for Design8-Q-42 


OUriaCc UCVCiUp* 


OJ 


Mem for Designs-Q 58 


To.al Draw 


-56 






Total Draw 


50 


Identical Pic. 


-48 




Jv 


Identical Pic. 


44 


Object Assembly 


-51 




S9 


Object Assembly 


56 


WAIS Perform. S. 


-49 


nero tor L/csigns 


00 


WAIS Perform. S. 


51 






Total Draw 


69 










Vis Num Span 


43 










Letter Series 


52 










Picture Arrange • 


48 










MAIS Perforin, 


51 











R Linear 



R Quadratic 




S Linear 



S Quadratic 





Digit Span-F -44 


Comprehension 


-45 






Similarities 


-45 






WAIS Verbal S. 


-43 



Note. Decimals and correlations less than .^2 omitted. 



the analysis of a composite index of difficulty (time minus 
correctness) that is reported belou. 

The analysis of variance on these scores is reported in 
Lohman (1979b). Confidence decreased significantly over all 
facets except llatch. For rotation and stimulus complexity, the 
decline uas linear (the quadratic contrasts uere not 
significant). For construction, the decline uas greater between 
no addition and one addition than between one and two piece 
add i t ions . 

There were a number of significant interactions. However, 
all were congruent with other analyses, particularly ZDIFF (see 
below) and total time. Therefore, they will not be discussed 
here. The important finding here uas that immediately obtained 
self report surity ratings closely paralleled the more 
"objective'' latency and correctness measures. Thus, self reports 
may have considerably more validity than is commonly assumed 
(e.g., Hisbitt and Wilson, 1977), especially when obtained as in 
this experiment. 



C^mp^rison q± SC0RE3 and Total Time Analyses , 
The separate analyses on total time and SC0RE3 clearly reveal 
that time and error rate are not interchangeable aspects of 
per f o rmanc e . 

Since the data were collapsed over match and type of addition 
for the correctness analyses, the analyses of variance on total 
time and SC0RE3 are not totally comparable. There were, however, 
two important differences between them. 

1. Error rate uas the same at levels two and three of 
the construction facet, while total time increased 
linearly over the three levels of . ons t rue t i on . 

2, sn^:- -factions were fewer and smaller in the SC0RE3 
f:'-^ J lysis than in the total time analysis. 

The Ufcst '^.T.riking differences between the two analyses, 
however, weie in the correlations between individual contrast 
scores and reference tests. These differences are summarized 
below for each contrast. 
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il Contrasts . 

Mean SCORES correlated significantly with ten spatial or Gf 
tests. Mean time* on the other hand, correlated s i g n i f i c sin 1 1 y 
uith one verbal reasoning test. Correlations with severj^l other 
verbal tests approached significance, while correlations with 
spatial tests centered near zero. 



Construction . 

For SC0RE3, a linear decline' was associated with high scores 
and a quadratic decline with low scores on Form Board and Object 
assembly* For total time, a linear increase was associated uith 
verbal ability. The quadratic contrast scores for total time 
correlated positively with Mar.ks Vividness of Visual Imagery 
Questionaire and negatively with Hidden Figures. 

The linear decline in SCORES over the rotation facet was 
correlated significantly with only rne reference test: Figures. 
The correlation with the Cards test was similar but not 
significant* For total time, the magnitude of the linear 
increase ut.s correlated with number of errors on the Cards test. 

Stimulus Compl exitv . 

Neither the linear nor quadratic SCORES stimulus complexity 
contrasts correlated with any of the reference variables. Both 
total time contrasts, on the other hand, correlated stroncjly with 
verbal t es t s . 

Product Compl exi tv . 

For SC0RE3# large decrements in performance over this facet 
uere associated with verbal ability. For total time, large 
incrf^ments were associated with general ability. 



n^tph ?nd Type o± Addition . 

Since data were collapsed over these facets in the SCORES 
analysis, only total time contrast scores were obtained. For the 
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match facets faster responses to positive test stimuli uere 
associated with high scores on Form Board and Picture Completion. 
Faster responses to negative test stimuli were associated with 
more errors on the Cards test and higher scores on Visual Number 
Span . 

Taking much longer on left addition than right additions 
correlated highly with Object Assembly and moderately with 
several verbal tests. 



Correlations between the two N contrasts and the reference 
tests address the construct validity of the test in global terms. 
They both answer the question* "What* in general* is measured by 
this task?" For SCORES* the answer is clearly "spatial ability." 
For total time* it is a more tentative "verbal ability* if 
anything." 

Given these global assessments* the contrast correlations 
within each facet look at the relationships between the pattern 
of scores over that facet and the reference tests. 

In general* the SC0RE3 contrast correlations supported the 
construct validity of the task* while the total time correlations 
revealed more about the strategies or processes subjects used to 
solve particular item types. These correlations have already 
been discussed in some detail for the stimulus complexity, 
product complexity* match and type of addition facets. 
Construction and rotation* however* have been examined only 
briefly. They are the two "process" facets and thus merit closer 
scrutiny. 

The differential correlations between the construction 
contrasts* and Form Board and Object Assembly support the 
hypothesis that znental construction is an identifiable spatial 
skill. Total time contrasts for this facet reveal that those who 
took much longer to solve the tuo addition items than would be 
predicted by their "basal" time on no construction items were 
probably reverting to verbal strategies. Even more interesting 
was the finding that those who reported strong visual imagery 



Summary q± the Separate Anal vs es . 
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showed smaller increases in time between two and t. s i ^ s piece 
additions than between no addition and one pie:e a.l;'.iticn. On 
the other hand, those whose timGS increased '..ct . s t. between t:>io 
and three piece additions may have been usi ii ^ r. o analytic 
spatial strategy. 

For the rotation facet, the correlation between the linear 
contrast scores and the two rotation reference tests. Cards and 
Figures, support thfi construct validity of the facet. Large 
increases in time over the facet were associated with number of 
errors on the Cards test. Examination of component times and 
their correlations suggests that those who made many errors on 
the Cards test may have been using a verbal strategy to solve 
some or all of the items. Thus, once again time reveals 

^ about process while correctness contrasts address the 

uct validity of the facet. 

IV. Combined Anal vses on Total Time and Correctness 
ThQ Advantages o f Combined Analyses . 

While separate univariate analysis are useful in previewing 
an overview of the major results, they have a number of 
Disadvantages. First, it is difficult to coordinate the results 
of several analyses on correlated measures. More important, 
however, is the fact that such analyses may be misleading. This 
is more a product of the experimental dependence of time and 
correctness than of their statistical dependence. 

The interaction between rotation and product complexity 
provides an example of the ambiguities that arise when time and 
correctness are analyzed separately. The interaction was highly 
significant for total time, marginally significant for SC0RE3 
(correctness adjusted for confidence and match time), and 
nonsignificant for SCORE (correctness). Plots of the cell means 
for each of the three dependent variables are shown in the top 
row of Figure 3.3. The plot for total time reveals an almost 
perfect linear relationship between tirae and rotation for low 
complexity constructed stimuli. However, it iippears that,'for 
constructed stimuli of high complexity, 180 degree rotations may 
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be performed as quickly as 90 degree rotations. 

While the plots for SCORE do not shed any light on the 
problem^ the plot for SCORED is suggestive. The latter plot 
shows a linear decrease in correctness for high product 
complexity items. Note also that probability correct (by SCOPES) 
for high product compl£?<ity is almost at .5 for 90 degree 
rotations and then below it for 180 degree rotations. This 
suggests that a 90 degree rotation of a complex constructed 
stimulus was about all th^ average subject could manage. 180 
degree rotations of such ^timuli were just too difficult, and so 
subjects gave up and rercr-ed to guessing. 



Svm and. Dif f erepcn r.core Analyses 
While such coordinate examinatxon of error and latency data 
is useful, there is a better way to examine their joint and 
independent effects. Time and correctness (by SC0RE3) were first 
independently standardized to zero mean and unit variance. Two 
scores were then derived by taking the sum and the difference of 
these standard scores. ZSUM was the sum of the standard scores 
for total time and correctness, and ZDIFF time minus correctness, 
ZDIFF contrasts fast-accurate (low negative) and s 1 o w- i naccu r a t e 
(high positive) performance, while ZSUM contrasts f as t- inaccu rai e 
Clow negative) and slow-accurate (high-positive) performance. 
Variations in facet difficulty or complexity should be reflected 
in ZDIFF, i.e., fast, accurate responses to easy items and slow# 
inaccurate responses to difficult items. ZSUM, on the other hand 
reflects variations in s p eed -accu r ac y tradeoff. 

Plots of ZDIFF and ZSUM for the Rotation by Product 
Complexity interaction are shown in the bottom row of Figure 3.3. 
Although the quadradic component of the interaction is still 
significant in this picture, the slopes of the lines at the two 
levels of product compleKity are more nearly parallel. Further/ 
the ZSUM plot reveals a large shift in speed-accuracy tradeoff 
for high product complexity items. Subjects may have given up 
too soon or mayhave . ien unable to perform these difficult items. 
Later results favor ie latter interpretation. 



since these tuo derived scores are statistically 
independent* each may be analyzed separately and the results then 
compared unambiguously. 

Analyses of variance on ZSUM and ZDIFF are presented in 
Tables 3,9 and 3.10. Estimated variance comp.::^nents are reported 
elsewhere (see Lohman* 1979b). In the ZDIFF analysis, design 
facets accounted for 52.3 percent of the variation in scores, and 
subjects end their interactions with design facets accounted ior 
44.0 percent. Thus, item difficulty was a product of both design 
facets and subject interactions. For ZSUM> on the other hand, 
the design facets accounted for only 9.4 percent of the variation 
in the data. Subjects and their interactions with design vectors 
accounted for 84.9 percent. Thus, the s peed -accu rac y tradeoff 
uas only slightly influenced by design facets; rather# it was 
largely a function of subjects and how they responded to 
particular item types. These relationships are shown graphically 
in Figure 3,4. 

Interpretation o f the Sum and Pi f f erence Score Analyses 
ii Contrast . 

This contrast reflects variability in the subject means about 
the grand mean. For ZDIFF, the correlations with spatial 
reference tests were mostly negative but nonsignificant. The 
only significant correlation was -.50 with WAIo Picture 
Arrangement. The correlations with verbal tests weie zero or 
slightly positive. Thus, except for the WAIS Picture Arrangement 
subtest* fast-accurate performance on this task was only weakly 
related to f as t -ac cu r a t e performance on other spatial tasks. 

The N contrast for ZSUM> on the other hand> had a number of 
large positive correlations with reference tests: Total Draw 
(.66)# Paper Folding (.55)> Letter Series (.55)> Visual Number 
Span (.54), Terman Concept Mastery (.52), Memory for Design 
Quality (,51)* WAIS Information (.47)* and Verbal Scale Score 
(.44). Although verbal tests are represent ed on this list, the 
higher correlations are associated with complex spatial and 
reasoning tests. The pattern is congruent with Cattell*s general 



Table 3.9 
ZSUM Analysis of Variance 



Source 


df 


MS 


F 


Source 


df 


MS 


F 


Mean 


1 


.00 


_ 


N 


29 


32.19 




Construction 


(2) 






NC 


(53) 






Linear 




39.95 


8.36* 


Llr4ear 


29 


4.78 


9.53* 


Quadratic 




22.70 


18.66* 


Quadratic 


29 


.85 


1.70 


Rotation 


(2) 






NR 


(58) 






Linear 




15.90 


7.17 


Linear 


29 


2.22 


4.43* 


Quadratic 


1 


11.35 


18.02* 


Quadratic 


29 


.63 


1.26 


Stimulus Complexity 


(2) 






NS 


(58) 






Linear 




15. 9C 


8.94* 


Linear 


29 


1.78 


: "^5^ 


Quadratic 




.74 


1.88 


Quadratic 


29 


.39 




Product Complexity 




2.09 


.95 


NP 


29 


2.21 


4.4. 


CR 




7.69 


11.10* 


NCR 


116 


.69 


1.38 


CS 




2.13 


4.25* 


NCS 


116 


.64 


1.28 


KP 




16.43 


15.31* 


NKP 


29 


1.07 


2.14* 


RS 




.97 


1.84 


NRS 


116 


.52 


1.05 


RP 


2 


5.24 


7.45* 


NRP 


58 


.70 


1.40 


SP 


2 


3 54 






JO 




1 QA* 


CRS 


8 


1.52 


2.73* 


NCRS 


232 


.56 


1.11 


KRP 


2 


.14 


.31 


NKRP 


58 


.47 


■ .93 


KSP 


2 


1.58 


3.19 


NKSP 


58 


.50 


.99 


RSP 


4 


5.06 


6.09* 


NRSP 


116 


83 


1.56* 


KRSP 


4 


1.32 


1.63 




116 


.8T 


1.61* 










Residual 


270 


.50 





* = p less than .01 
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Tab'.f 3 10 
ZDIFF Analysis of \'a ' .•.anr.e 





Source 


df 


MS 


i 




Source 


df 


MS 


F 


Mean 


1 


.00 


- 




N 


29 


22,77 


- 


Constructiou 


(2) 








NC 


(58) 






Linear 


1 


1099,89 


305. 


53* 


Linear 


29 


3.60 


6.59* 


Quadratic 


1 


94.73 


119. 


76* 


Quadratic 


29 


,79 


1.45 


Rotation 


(2) 








NR 


(58) 






Linear 


1 


255.79 


122. 


80* 


Linear 


29 


2.08 


2 31* 


Quadratic 


L 


3.82 


4. 


43 


Quadratic 


29 


.86 


1.58 


Stimulus Complexity 


(2) 








NS 


(58) 






Linear 


1 


153.91 


89. 


01* 


Linear 


29 


1.73 


3.17* 


Quadratic 


1 


.92 


1. 


79 


Quadratic 


29 


.51 


.94 


Product Complexity 


1 


243.85 


93. 


21* 


NP 


29 


2.62 


4.79* 




4 


1.24 


2. 


11 


NCR 


116 


.59 


1.07 






11.16 


18. 


19* 


NCS 


116 


.61 


1.12 


KP 


1 


8.34 


6 . 


71 


NKP 


29 


1.21 


2.22* 




A 


.97 


1. 


65 


KRS 


116 


.58 


1.07 


HJ? 


2 


-.76 


6. 


88* 




58 


.^4 


1.53 


S? 


2 


8.06 


8. 


96* 


NSP 


58 


.90 


1.54* 


CRS 


8 


2.73 


3. 


95* 


NCRS 


232 


.69 


1.27 


KRP 


^\ 
^. 


3.55 


5. 


54* 


NKRP 


58 


.64 


1.17 


KSP 


2 


25.64 


31. 


93* 


NKSP 


58 


.80 


1.47 


RSP 


4 


2.36 


3. 


09 


NRSP 


116 


.76 


1.40 ■ 


KRSP 


4 


23.79 


27. 


27* 


NKRSP 


116 


.87 


1.60* 












Residual 


270 


.55 





* » p lec£ than .01 
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ZD IFF- 



2SUM 




fluid ability 
accuracy over 
(G) . 



factor (Gf). Thus, the general 
speed was associated with Gf or 



tendency to opt for 
general ability 



Construct i on . 

Construction was the most powerful design facet. Both the 



As can be seen in Figure 3.5, tha quadratic effect reflects s 
larger increase in difficulty between no construction (C=1) and 
one piece construction (C=2), than between one and two piece 
additions. Subject interactions with the linear term were also 
highly significant, reflecting large indi.vidual differences in 
slopes. However, there were no significant correlations between 
the construction linear contrast and reference constructs. 
Individual scores for the quadratic construction contrast, 
however, correlated .51. .43, .54 and .45 with Form Board, total 
number of correct draw trials, WAIS Object Assembly, and WAIS 
Performance Scale Score, respectively. Thus those subjects who 
made relatively more errors and had relatively longer latencies 
on level 3 construction items than on level 2 items did better on 
the draw trials and on these reference tests. In other words, 
subjects who '^topped out" at level s construction items tended to 
have lower scores on these three variables. 

A plot of the ZSUn means over the construction facet is also 
shown in Figure 3.5. Since subject interactions with both the 
linear and quadratic components were large, the plot is not 
representative of individual data. The most striking 
relationships here were the high positive correlations between 
the linear contrast scores and ver.bal and memory span reference 
constructs, and high negative correlations between quadratic 
contrast scores and spatial reference constructs. Further, the 
two contrasts had opposite correlations with Total Draw, .60 for 
the linear contrast and -.63 for the quadratic contrast. The 
quadratic contrast correlations indi'iate that those who showed a 
much larger shift in s peed - cc u r ac y tradeoff between one and twn 
piece additions than between zero and one piece additions 



1 i n ea r 



and quadratic effects 



w 



ere highly significant for ZDIFF. 
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Figure 3.5. Plots of main effect cell means for total time, SCORES, 
ZDIFF, and ZSUM. 
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performed poorly on the reference spatial tests and the draw 

trials. On the other hand, large shifts between no construction 

and two piece additions uere associated with verbal and verbal 
memory span tests. 

Rotation . 

For ZDIFF, the linear contrast was highly significant, and 
the quadratic contrast only weakly significant. Figure 3,5 
reveals the nearly perfect linear relationship between ZDIFF and 
rotation. Similarly, subject interactions with the linear 
component were strong, while the subjects by rotation quadratic 
term was only marginally significant. There were no significant 
correlations between either contrast and the reference tes-s. 

Again the ZSUM contrasts had several noteworthy 
correlations. The linear and quadratic contrasts corelated .45 
and -.22 with number of errors on the Cards tests. Corresponding 
correlations with the Total Draw score were\44 and --.02, Thus, 
those who shifted to slower, more accurate responding for 180 
degree rotations did better when they had to draw the answer, but 
made more errors on the multiple choice Cards test. 

St imu lu s Comp 1 ex i t y . 

For ZDIFF, only the linear contrast and subjects by linear 
contrast were significant. The straight line plot; is shown in 
Figure 3.5. It is noteworthy that the analyses of total time 
suggested significant nonlinear trends for stimulus complexity 
and rotation. The plots became almost perfectly linear only when 
latency and correctness were analyzed jointly. The linear 
contrast had only two significant correlations: .53 with WAIS 
Verbal Scale Score, and .67 with MAIS Digit Span Backwards. The 
quadradic component correlated .48 with Visual Nuraber Span and 
.55 with WAIS Digit Span Forward. Together, these correlations 
suggest that those who showed the strongest effects for stimulus 
complexity may have been using verbal strategies to encode and 
remember the stimuli. 

The plot for ZSUM on stimulus complexity is shown in Figure 
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3.5. Again* only the linear and subjects by linear corttrasts 
werfe significant. As with ZDIFF, correlations with reference 
tests again suggest a connection uith verbal ability. Both 
linear and quadratic contrast scores correlated .t*5 uith WAIS IQ. 
Only the linear contrast had other significant correlations. 
Four were uith verbal tests (Word Beginnings and Endings, .50; 
MAIS Comprehension, .43, Vocabulary, .47, and Verbal Scale Score, 
.48) while two were uith spatial tests (Cards Wrong, .50; and 
WAIS Object Assembly, .44). However, Cards Wrong is probably a 
better index of verbal strategies than spatial ability. Thus, 
large shifts toward slower, more accurate performance over the 
rotation facet were most strongly related to verbal ability and 
verbal strategies. 



P r 0 d u c Complexi tv . 

Ar shown in Table 3.10, product complexity and subjects by 
pr:5duct complexity were highly significant in ZDIFF analysis. 
Both errors anci latency were greater for high complexity 
constructed stimuli. 

Contrast scores for product complexity were the best 
predictor of performance on the reference tests. High 
correlations with both verbal and spatial tests implicate general 
ability. In particular, the p er f o r mane e o f high ability students 
was influenced by the comp.exity of the to-be-constructed 
stimulus. This is most easily explained by assuming that high 
ability students were able to combine the stimuli mentally as 
directed, (using verbal strategies spatial strategies or both) 
while low ability students ware not able to generate a clear 
image of the construct ed stimulus. 

This result is even more interesting in the li-ht of the 
ZSUM analysis. While the subjects by product complexity 
interaction was significant for ZSUM, the main eff-sct was nrt. 
However* none of the contrast correlations were significant. 
Hence, t e product complexity effect is not a reflection of 
systematic changes in s p eed -ac cu r an y tra^^eoff. 
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Def ense o f Sum and D i f f er ence Scores 

The propriety of any index depends on the question that is 
asked. Thus^ questions about the relationships between latency 
or correctness and the design facets or reference tests are best 
addressed by separate analyses on the two measures. 

Butr for example^ reading and mathematics achievement test 
scores may be usefully combined to represent two new variables^ 
namely, general scholastic achievement* and differential 
cchievement in reading and mathematics. Verbal and spatial 
scores also may be summed to represent general abilty and 
subtract d to provide a special ability contrast. Similarly, 
Ir.tency and correctness can be combined to form new indiceiJ. Th 
major difference between the separate scores and the two 
combination indices lies in their interpretation. Further, 
unlike the other examples cited above, latency and correctness 
are not only statistically, but also experimentally dependent. 
This also affects their interpretation. 

The difference score (time minus correctness) is best 
interpreted as a composite index of difficulty, while the sum 
represents both the s peed- accu r ac y tradeoff and the independent 
contributions of the two variables. 

If latency and correctness have opposite patterns over a 
facet (e.g., latency increases lipearly while correctness 
decreases linearly), then the means analysis on the differbnce 
score will show a larger effect for the facet than either of the 
separate analyses. Similarly, the sum score will be 
nonsignificant over the facet, reflecting the fact that the 
patterns are the same, but opposite> for the two variables. 

For the correlational analysis, if both latency and 
correctness correl^:e positively with a given test, then the sum 
score correlation will be higher, and the difference score 
correlation zero. If latency correlates with verbal tests and 
correctness with spatial tests, then the sum will correlate with 
both and the difference with neither cluster of tests. In such 
cases, separate analyses on latency and correctness provide 
greater insight. 
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Houever^ there are times uhen the sum and difference scores 
provide unique information. Several puzzling interactions 
appearing in the separate analyses uere eliminated or attenuated 
in the combined analyses (e.g., see Figure 3.3). Similarly, both 
rotation and stimulus complexity shoued nonlinear trends in the 
total time analysis, but uere linear in the ZDIFF analysis. 
Thus, "difficulty" increased linearly over these facets, while 
time and correctness had nonlinear, but opposite, quadratic 
ef f ec ts . 

Similarly, correlations between the sum and difference scores 
contrasts were often quite unlike the contrast correlations in 
the separate analyses. For example, in the analysis on 
construction component time and construction SC0RE3 (see below)* 
the type of addition contrast for component time correlated 
significantly with seven verbal tests (r = .41 to .56) and Object 
Assembly (r = .45). The SC0RE3 type of addition contrast 
correlated with Marks Vividness of Visual Imagery Questionaire (r 
= .49) and negatively with Visual Number Span (r = -.45). 
However, the sum score correlated with Form Board (r = .43) and 
Object Assembly (r = .46), while the difference score correlated 
negatively with Marks Questionaire (r = -.56) and positively with 
Visual Number Span. Thus, the verbal tests dropped from the 
picture and the two tests most similar to the experimental 
construction task (Form Board and Object Assembly) became the 
only two tests correlated with the sum score. The sum score 
correlations occurred because Form Board and Object Assembly were 
positively correlated with both the time contrast (r = .32 and 
.45) and the SC0RE3 contrast (r = .32 and .28), although the 
three smaller correlations were not significant. On the other 
hand, the SC0RE3 type of addition contrast had zero or slightly 
negative correlations with the verbal tests. Thus, when SC0RE3 
and time were added together, the two tests positively correlated 
with both the time and correctness contrasts emerged with larger 
correlations, while the verbal tests correlations became 
nons ignificant. 
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Summary o f the Sum and Pi f f erence Score Anal vs es 
The analyses of variance on ZSUM and ZDIFF differed most 
significantly in the way score variation was accounted for by 
facets and subject interactions with facets. In particular* the 
ZDIFF analysis revealed that item difficulty (on a scale of fast- 
accurate to s 1 ou~i naccu ra t e ) uas a function of both facets and 
subject interactions with facets. The speed-accuracy tradeoff 
represented by ZSUM^ on the other hand » was only slightly 
influenced by design facets. Rather^ variation in these scores 
was largely a function of subject interactions with design 
facets. 

Most of the relationships between ZSUM and ZDIFF contrast 
scor 9.nd reference tests are congruent with the previous 
analyses on total time and SC0RE3. There are important 
differencest hotrever, and ther;e are noted below. 

H Contrast . 

For ZDIFF> gen^r Aly fast and accurate performance on this 
task was weakly related to similar performance on other spatial 
tasks, and strongly indicative of high scores on the WAIS Picture 
Arrangement subtest. There was no relationship, or» if anything, 
an inverse relationship between ZDIFF and performance on Verbal 
tests. 

For ZSUM^ the general tendency to opt for accuracy over speed 
was associated with Gf. 

Cons t rue t ion . 

As in the SC0RE3 analysis, nonlinear increases in ZDIFF were 
positively related to Form Board and Object Assembly. Those who 
did not take more time and make more errors on two piece 
additions than on one piece additions performed poorly on two 
reference construction tests. 

One explanation for this phenomenon would be that those who 
performed poorly on these construction tasks did not fully 
appreciate the increased difficulty of the two piece additions, 
and thus did not really try harder on the more difficult 
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additions. However, the ZSUM contrast correlations argue s^gainst 
this hypothesis. Large shifts in speed-accuracy between levels 
two and three on the construction facet were also related to poor 
performance on the reference tests. Therefore, a better 
explanation of the ZDIFF quadratic correlations is that those who 
did not show an increase in error rate and time when shifting 
from one to two piece additions had reached their limit on the 
one piece additions. 

Finally, the ZSUM linear correlations related large shifts in 
speed-accuracy trade off between no construction and two 
additions to verbal ability. 

Rotation . 

Only the ZSUM linear ooutrast had significant correlations 
for the rotation facet. Large shifts toward slower, more 
accurate performance were related to number of errors on the 
Cards test and Visual Number Span. 

Si imulus Compl exi t v . 

The ZSUM and ZDIFF analyses indicated that those who showed 
the largest effects for this facet may have been using verbal 
strategies to encode and remember the stimuli. The major 
difference between these correlations and those for the same 
contrast in the total time analysis was the focus on verbal shoit 
term memory tests ixi the ZDIFF analysis. 

Product Compl exi t v . 

The most significant contribution of the ZSUM# ZDIFF analyses 
here was the finding that the product complexity effect was not a 
function of systematic changes in s p eed -acc u r ac y trade-off. 

V. Component Time Ana 1 vs es 

At least xwo component times (encode, match) and as many as 
5our component times (encode, add, rotate, match) were collected 
for each item. Encoding and match times were available for all 
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216 items* construction and rotation times for 144 items. Since 
construction always preceded rotation^ rotation component times 
for the 96 items that also required construction uere not 
experimentally independent of the construction operations. 
Rotation time on these it ems depended not only on the success of 
the rotation, but also on the success of the preceding addition. 
Similarly^ only 24 match times uere not preceded by construction 
or rotation. Therefore^ the analyses of the rotation and match 
component times uere necessarily more limited than, those on 
encoding and construction. 



Encoding 

Means Ana 1 ys is . 

Stimulus complexity and practice (or block) uere the only 
tuo factors represented in the design likely to affect encoding. 
The analysis of variance of encoding time by these factors is 
reported in Table 3.11. 

Only the linear components of both stimulus complexity and 
block uere significant. Cell means uere 4.02^ 5.41^ and 6.92 sec 
for lou, medium, and high complexity stimuli, respectively. Cell 
means for the four blocks uere 6.71, 5.33, and 5.25, and 4.49 
s ec . 

There uere significant individual differences in both linear 
contrasts and in the nonlinear block contrast. Subjects also 
interacted significantly uith the linear by linear contrast. 



Contrast Correlations 

il Cont ras t . The first important finding is a negative one: 
there uere no relationships betueen average encoding time and any 
of the reference tests or total scores. The correlation betueen 
mean encoding time and IQ uas -•03, The only significant 
correlation (r=.48) uas uith number of marks made on the Form 
Board test, a strategy index. Those uho made more marks on the 
stimuli in the Form Board test spent more time studying the 
stimuli in this experiment. 

Stimulus Complexity - The stimulus complexity linear 




table 3.11 

Analysis of Variance for Encoding Component Time 



Source 


d£ 


MS 


F 


Source 


df 


MS 




Mean 


1 


10683.44 


176.08 


N 




60.67 


- 


Stimulus Complexity 


(2) 






NS 








Linear 


1 


503.07 


69.90* 


Linear 






11.80* 


Quadratic 


1 


.32 


.48 


Quadratic 


29 


.68 


1.11 


Block 


(3) 






NB 


(87) 






Linear 


1 


203.11 


18.33* 


Linear 


29 


11.08 


18.16* 


Nonlinear 


2 


26.46 


2.05 


Nonlinear 


58 


6.45 


10.57* 


SB 


(6) 






NSB 


(174) 






Lin X Lin 


1 


2.81 


1.85 


Lin X Lin 


29 


1.52 


2.50* 


Qd X Nln 


5 


.82 


1.34 


Qd X Nln 


145 


.61 





*MSCe) estimated by the NSCquadrat:ic)B(nonlinear) interaction. 



*p less than .01 



ERIC 



SI loo 



contrast had significant correlations uith UAIS Digit Span 
(r=-47). Vocabulary (r=.44), Verbal Scale Score i.^9), and thus 
IQ (.43). The correlation uith WAIS Performance Scale Score was 
a nonsignificant .27. Only one spatial testr Paper Folding^ 
correlated significantly uith the contrast (r=,47). There was 
also an interesting differential set of correlations uith the tuo 
correctness indices: r=.43 with total SCORE# but only .14 uith 
total SC0RE3. 

The quadradic component had even higher correlations uith 
verbal tests. Correlations uith all WAIS verbal subtests except 
Arithmetic uere significant, and the correlation uith Verbal 
Scale Score uas .58. Other significant correlations uere total 
correct Drau trials (.52)r Visual Number Span (.44)^^ Number 
Comparison (.43)» and Camouflaged Words (.54). 

These quadratic component correlations mean that those uho 
shoued greater increases in encoding time betueen medium and high 
complexity than betueen lou and medium complexity tended to score 
higher on verbal tests than those uhose times increased linearly 
or by the reverse pattern. 

Strategy reports provide a good explanation of this 
phenomenon. Some subjects reported that they tried to remember 
complex stimuli by seeing them as composites of simpler forms 
such as triangles and squares. Both additive and subtractive 
versions of this strategy uere reported , i.e., ••a triangle on 
top of a square^" sind ••a square uith a triangle cut out of the 
top.** 

Subjects using this strategy uould be much faster on lou and 
medium complexity stimuli (3-4 points) since most of these 
stimuli can be immediately labelled^ even uhen irregular (e.g. '*a 
triangle leaning to the right"). Most high complexity stimuli 
were'not so familiar^ and so the process of decomposing them into 
simpler shapes or looking for some concrete association uould 
have taken longer. 



The Construction Component 



Of the total 



216 items* 144 had a construction component. 
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Construction component times were relatively "clean" since 
rotationr if requiredr aluays folloued construction. Data uere 
collapsed over the rotation facets and mean time and number 
correct uithin each of these 2^ cells uere then used in the 
analyses. Note that the construction facet has only two levels 
here. 

Separate analyses of variance uere performed on time and 
correctness (by SCORES) and are report ed in L ohman C 1 979 b ) '. The 
major differences betueen the tuo analyses uere: 

1. Construction (K) uas a highly significant facet in 
the time analysis (F=118.79)» but not significant 
in the SCORES anal ys is (F=.4^). Thus, subjects 
took much longer to add tuo stimuli to a given base 
than to. add one» However, they did not make 
relatively more errors on the tuo piece additions. 

2. The KP, STf and KSTP interactions uere highly 
significant in the time analysis but not 
significant in the SCORES analysis. 

3. In the KS and SP interactions, the linear component 
of the contrast uas highly significant in one 
analysis and the quadratic component significant in 
theotheranalysis. 

As in previous analyses, time and correctness uere analyzed 
jointly by computing the sum and difference of the independently 
normalized scores. These variables uere called CSUM and CDIFF. 
Analyses of variance on each are reported in Tables S.12 and 
S.1S. Cell means are reported elseuhere (see Lohman, 1979b). 

In the CDIFF anal ys is, all main effects and most facet 
interactions uere significant, uhile only the NK, NP, NKP, and 
NKTP subject-facet interactions uere significant. On the other 
hand, construction (K) and product complexity uere the only 
significant main effects in the CSUM analysis. Most facet 
interactions uere significant, but generally smaller than their 
counterparts in the CDIFF analysis. Large su b j ec t- f ac e t 
interactions uere obtained for K» S linear, and P. 
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Table 3.12 
CSUM Analysis of Variance 



Source 


df 


MS 


F 


Source 


df 


MS 


F 


Mean 


^1 


• 000 




N 


29 


19.074 




Coastruction (K) 


1 


iZZ* //O 










0 onA 


Stimulus Complexity 


/ON 

(2) 






NS 








Linear 


1 


.994 


• 78 


Linear 


29 


1 • 27o 


2. «*o* 


Quadratic 


I 


ooo 
• OOO 


o n/. 
2.04 


Quaoratic 






QA 


Type of Addition 


I 


• 376 


.47 


NT 


29 • 


.cul 


1 CA 


Product Complexity 


I 


• <b4o 


. 12 






o r*o<a 
2 .U2J 


Q QnA 


KS 


(2) 






NKS 








Linear 


1 


7.057 


15.86* 


Linear 


29 


.445 


.86 


Quadratic 


1 


• OQO 


.00 


Quadratic 


29 


.669 


1. 29 


KT 


1 


.143 


.15 




29 


.938 


1.81* 


KP 


1 


16.119 


17.43* 


NKP 


29 


.925 


1.78 


ST 


(2) 






NST 


(58) 






Linear 


1 


.571 


1.05 


Linear 


29 


.545 


1.05 


Quadratic 


1 


.721 


.93 


Quadratic 


29 


.773 


1.49 


SP 


(2) 






NSP 


(58) 






Linear 


1 


13.913 


16.99* 


Linear 


29 


.819 


1 CO 
1 .JO 


Quadratic 


1 


4.152 


6. 19 


Quadratic 


o o 
29 


.671 


1. 27 


TP 


I 


1.860 


2.99 


NTP 




.022 


1 on 
1 . 2U 


KST 


(2) 






NRST 








Linear 


I 


.404 


X .01 


Linear 


27 


Ann 


77 


Quadratic 


1 


4.728 


11.07* 


Quadratic 


o o 
29 


.(i»2/ 


.02 


RSP 


(2) 






NKSP 


(58) 






Linear 


1 


3.347 


5.86 


Linear 


29 


.571 


1.10 


Quadratic 


1 


2.902 


3.50 


Quadratic 


29 


.828 


1.60 


KIP 


1 


7.864 


7.49 • 


NKTP 


29 


1.050 


2.02* 


SIP 


(2) 






NSTP 


(58) 






Linear 


1 


.758 


1.19 


Linear 


29 


.639 


1.23 


Quadratic 


1 


3.675 


4.65 


Quadratic 


29 


.790 


1.52 


KSTP 


(2) 






NKSTP 


(58) 








1 


4.586 


9.38* 


Linear 


29 


.489 




Quadratic 


1 


1.236 


2.25 


Quadratic 


29 


.549 





*MS(e) estimated by NKSTP interactioa. 
*p leva than .01 
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Table 3.13 
CDIFF Analysis of Variance 



Source 


df 


MS 


F 


Source 


df 


MS 




Mean 




.000 




N 


29 


11.188 




Construction (K) 




140.747 


97.81* 


NC 


29 


1.439 


2.44* 


Stimulus Complexity 


(2) 






NS 


(58) 






Linear 




70.411 


96.98* 


7'lnear 


29 


.726 


1.23 


Quadratic 




5.599 


11.13* 


Quadratic 


29 


.503 


.85 


Type of Addition 




42.912 


53.91* 


NT 


29 


.796 


1.35 


Product Complexity 




253.171 


109. 17* 




29 


2.319 


3.94* 


ts 


(2) 






NKS 


(58) 






Linear 




8.619 


15.23* 


Linear 


29 






Quadratic 




6.459 


16.69* 


Quadratic 


29 


.387 


.66 


CI 




33.391 


SI AS* 


NKT 


2Q 




1 in 


KP 




8.619 


8.16* 


NKP 


29 






ST 


(2) 














Linear 




17.511 


26.57* 


Linear 


29 


.659 


1. 12 


Quadratic 




.192 


.29- 


Quadratic 


29 


.666 


1.13 


SP 


(2) 






NSP 


(58^ 






Linear 




16.251 


23.52* 


Linear 


29 


.691 


1.17 


Quadratic 




4.401 


5,65 


Quadratic 


29 


.779 


1.32 


TP 




28.227 


38.61* 


NTP 


29 


.731 


1.24 


KST 


(2) 






NKST 


(58) 






Linear 




13.187 


24.11* 


Linear 


29 


.547 


.93 


Quadratic 




.499 


.82 


Quadratic 


29 


.612 


1.04 


KSP 


(2) 






NKSP 


(58^ 






Linear 




7.203 


13.72* 


Linear 


29 


.525 


.89 








12 21* 




2Q 


. / / o 


1 "XO 
1 . JZ 


KTP 




25.560 


21.48* 


NKTP 


29 


1.190 


2.02 


STP 


(2) 






NSTP 


(58) 






Linear 




1.532 


2.02 


Linear 


29 


.760 


1.29 


Quadratic 




5.217 


7.46 


Quadratic 


29 


.699 


1.19 


KSTP 


(2) 






NKSTP 


(58) 






Linear 




.001 


.00 


Linear 


29 


.672 




Quadratic 




7.530 


14.88* 


Quadratic 


29 


.506 





^-MSCc; esclmaced by NKSTP Interactloa. 



*p less than ,01 
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Const ruc t i on Facet . 

As previously noted» K uas a powerful facet in the component 
time analysis, but not significant in the SCORES analysis of 
variance. This emerged as a large main effect in both the CSUM 
and CDIFF analyses. The latter reflects thefact that tuo piece 
additions uere more difficult than one piece additions. The CSUM 
main effect reflects the large changes in speed-accuracy tradeoff 
over the facet. Subjects maintained a constant error rate by 
taking an average of ^.56 sec longer to solve the tuo piece 
additions. 

Individual contrast scores on CDIFF correlated significantly 
uith Cards Wrong (.U8), total correct Draw trials (.49), 
Necessary Arithmetic Operations (.43)* Terman Concept Mastery 
(.55), MAIS Information (48), Vocabulary (46), and Verbal Scale 
Score (.49). Thus high verbals uere most drastically affected by 
the shift from one to tuo piece additions. 

The CSUn contrast scores had only one significant 
correlation: .56 uith Visual Number Span Longest Correct. 
Perhaps those good at remembering numbers tend to be poor at 
remembering figures. 

Stimulus Complexity . 

Both linear and quadratic stimulus complexity contrasts uere 
significant in the CDIFF analysis. The latter reflects a greater 
increase in difficulty betueen medium and high than betueen lou 
and medium complexity. Neither contrast uas significant in the 
CSUM analysis. 

Individual CDIFF linear contrast scores correlated 
significantly uith a cluster of tests similar to those found on 
factors called Reasoning in th^factor analytic literature. The 
tests and their correlations uere: Necessary Arithmetic 
Operations (.43), MAIS Arithmetic (.50), Digit Span Backuards 
(.47)» and Verbal Scale Score (.45). On the other hand, the 
CDIFF quadratic contrast scores correlated most strongly uith 
spatial reference tests. Seven of ten significant correlations 
uere uith spatial tests. Thus, for subjects uith high spatial 
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scores, additions involving medium complexity stimuli were only 
slightly more difficult than those involving low complexity 
stimuli. Additions of high complexity stimuli were much more 
difficult . 

For CSUM, the 1 in ear contrast scores correlated with Word 
Beginnings and Endings (.55), Uses for Things (.51), and MAIS 
Vocabulary (.44). Once again, large changes in speed-accuracy 
tradeoff over the stimulus complexity factor are associated with 
verbal ability, this time with a cluster more properly called 
Mord Fluency or Verbal Productive Thinking. 

Type o± Addition . 

While the main effect indicates that adding to the- left was 
generally more difficult than adding to the right, the KT 
interaction reveals that this difference held only for two piece 
additions. Contrast scores on CDIFF for this factor correlated 
-.56 with Marks Vividness of Visual Imagery Questionaire ?and .56 
with Visual Number Span. Thus, those who had the most difficulty 
adding to the left reported poor visual imagery but had good 
short term recall for numbers. 

Product Complexity . 

Once again product complexity emerged as a powerful factor, 
both in the CDIFF analysis of variance and the CDIFF contrast 
correlations. Additions that yielded high complexity products 
were much more difficult than those that combined to form low 
complexity products. This held for both time, correctness, and 
the CDIFF score. 

CDIFF contrast scores correlated with complex spatial tests 
(Paper Folding, Surface Development), Gf or G tests (Hidden 
Figures, Necessary Arithmetic Operations, Leter Series, Terman 
Concept Mastery. MAIS Block Design), and the MAIS Verbal tests. 
Thus, the effect once again appears to be associated with general 
ability. High ability students took more time and made more 
errors on high than on lou product complexity additions. 

The only significant CSUM correlation was with an "internal*' 
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variable: Total Correct Drau (r=.44). This reflects the obvious 
conclusion that. those uho uere more careful on high complexity 
additions uere better able to draw them when requested to do so. 

The N Contrast and Cell Total s, 

The CDIFF N contrast reflects subjects' relative positions on 
a fast-accurate to s 1 ow- i n accu r a t e scale. The only significant 
reference test correlations for this contrast were -.44 with 
Figures and -.58 with UAIS Picture Arrangement. Thus # fast-- 
accurate construction was associated with high scores on these 
two tests . 

The CSUM N contrast scores, on the other hand, correlated 
with a number of reference tests. The tests and their 
correlations with CSUM are shown in the first colu.Ttn of Table 
3.14. High scores on these tests are associated with slow- 
accurate responding in this task. Given the preponderance of 
verbal tests in the list, it would appear that high CSUM scores 
were more associated with verbal ability (or Gc) than spatial 
ability (or Gf). 

The N contrast scores for the independent analyses on the 
construction component time and SCORES were even more revealing. 
These contrasts represent average construction component time and 
average SCORES for each subject. As shown in Table 3.14, average 
construction time correlated with verbal tests, while average 
SC0RE3 correlated with spatial tests. 

Mean time and SC0RE3 within each of the 24 cells of the 
construction component design were also correlated with the 
reference tests. Six of these component times correlated with 
UAIS Verbal Scale Score. All were for the more difficult cells, 
i.e., high stimulus complexity (four of six), high product 
complexity (all six), two piece additions (four of six), and left 
additions (four of six). Cell means for the easier items had 
small positive correlations with verbal tests and zero or sL-jht 
negative correlations with spatial tests. However, only two 
Closure Speed test correlations were significant (see below). 

For the SC0RE3 cell totals, only iour of the 24 cells 

l07 88 



Table 3.14 

Correlations between Construction CSUM, Component Time, 
and SCORES N Contrasts and Reference Tests (N»30) 



Reference Test 


CSUM 


Time 


SC0RE3 


Jraper roxamg 


Jo 




67 


Form Board 






CC 


Surface Development 


/ o 




OJ 


Memory for Designs - Quality 


58 






Total Correct Draw Trials 


75 


51 


/J 


Copying 








Cards 






A 7 


Figures 






CC 

J J 


Identical Pictures 






AS 


Visual Number Span 








Word Transformations 
ft 








Letter Series 


50 




AS 


Terman Concept Mastery 


c o 
5o 


J** 




WAIS Information 


52 


47 




Digit Span Forward 


44 


43 




Vocabulary 


48 


47 




Picture Arrangement 






49 


Object Assembly 






42 


Verbal Scale Score 


52 


45 




Performance Scale Score 






57 . 


Full Scale Score 


52 




52 



Note. Decimals and correlations less than .41 omitted. 
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correlated significantly with MAIS Verbal Scale Score. Houever, 
these same cells also had high correlations uith spatial tests» 
thus the relationship is with general ability. On the other 
hand, 23 cells correlated significantly with spatial tests. The 
remaining three cells did not correlate with anything in the 
battery . 

Therefore* component latency and correctness uere largely 
independent aspects of performance. Long solution times on 
difficult additions uere associated uith high verbal ability. 
Latency on easy additions uas unrelated to either spatial or 
verbal ability* Correctness on both easy and difficult i t ems uas 
associated uith spatial ability. 



Closure Speed , 

Tuenty-one of the 24 correlations betueen construction 
component cell mean times and the Closure Speed reference factor 
uere negative. Houever* only tuo uere significant. The only 
design facet shared by the ten highest correlations (r=-.30 to 
-.60) uas lou product complexity. None of the SCORES cell total 
correlations uith the Closure Speed factor uere significant (in 
fact, only tuo uere above .17). Thus* as argued elseuhere 
(Lohman* 1977b; 1979)* Closure Speed is not a separate cognitive 
skill* but merely one aspect of spatial ability. Furtherr these 
results support the argument that the factor uould be better 
represented by latency than the usual index of total correct 
uithin a given time limit. 

The Rotation Component 

Of the 216 items* 144 required rotation and thus had a 
rotation component time. Since construction aluays preceded 
rotation* rotation component times for the 96 items that also 
required construction uere not experimentally independent of the 
construction operations. Rotation time on these items depended 
not only on the success of the rotation* but also on the success 
of the preceding addition. 

Separate analyses uere performed on the 48 items uhere 
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rotation uas not preceded by cons 
requiring both construction and r 
analyf^is are presented in Lohman 
only briefly here. 



truction and on the 96 items 
otation. Details of the latter 
(1979b), and will be reviewed 



Ha Construction Items , 

Analyses of variance on rotation time and SC0RE3 for the 48 
no construction items are shown in Tables 3.15 and 3.16. 
Separate analyses were also performed on the sum and difference 
of normalized rotation time and correctness scores. However, 
since there were no discrepancies between the rotation time and 
SC0RE3 analyseSf the sum and difference scores provided no unique 
information. The difference score analysis coincided with those 
reported in Tables 3.15 and 3.16, while neither facets nor • 
subject-facet interactions were significant in the sum score 
analysis . 

Both rotation time and SC0RE3 showed large main effects for 
rotationr no main effect for stimulus complexity, and a smalls 
but significant rotation by stimulus complexity interaction. 
There were also large practice effects. Both rotation time and 
errors decreased linearly over the four blocks. Figure 3.6a 
shows a substantial decrease in rotation time over blocks, but 
only a slight increase in slopes. ^As is shown in Figure 3.6b, 
the decrease in errors was much larger for 180 degree rotations 
than for 90 degree rotations. 

Although the rotation by stimulus complexity interaction wr.s 
significant in both analyses, it is probably not reliable. The 
interaction for rotation time is plotted separately for each 
block in Figure 3.7. The interaction was different in each 
blockf and significant only in the first two blocks. 



Cprr^latjop? 

If nothing else. Figure 3.7 reveals that what was happening 
in Block 1 is not the same as what was happening in Blocks 3 and 

This suggests that practice alters the relationships between 
rotation times and scores, and external variables. 
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Table 3.15 

Analysis of Variance for Rotation Component Time 



SoUlTC^ 


df 


MS 


> 

F 


Source 


df 


MS 




Mean 


1 


3162.33 


195. 3A* 


N 


29 


16.18 


- 


Rotation 




43.27 


35.41* 


NR 


29 


1.22 


1.54 


Stimulus Complexity 


(2) 






NS 


(58) 






Linear 




.05 


.07 


Linear 


29 


.83 


1.05 


Quadratic 




1.A4 


3.43 


Quadratic 


29 


.42 


.53 


RS 


(2) 






NRS 


(58) 






Linear 




1.99 


2.11 


Linear 


29 


.94 




Quadratic 




7.81 


12.24* 


Quadratic 


29 


.63 





^SCe) estimated by NRS interaction. 
*p less than .01 
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Table 3.16 
Analysis of Variance for Rotation SC0RE3 



Source 


at 


MS 


F - ' 


Source 


df 


MS 


pa 


Mean 


1 


110.646 


1121.19* 


N 


29 


.099 




Rotation 




.176 


11.15* 


NR 


29 


.016 


.81 


Stimulus Complexity 


(2) 






NS 








Linear 




.038 


2.38 


Linear 


29 


.016 


.89 


Quadratic 




.002 


.11 


Quadratic 


29 


.019 


.81 


RS 


(2) 






NRS 


(58) 






Linear 




.022 


1.01 


Linear 


29 


.022 




Quadratic 




.207 


11.96* 


Quadratic 


29 


.017 





^S(e) estimated by the NRS interaction. 
*p less than .01 
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Figure 3.7. Plots of the rotation by stimulus complexity interaction, by block. 



115 



ERIC 



Correlations between mean SCORE and rotation time within 
each block* and selected reference variables are shown in Table 
3.17. SCORE had its maximum correlations in Blocks 3 and ^, 
while rotation time corielations were highest in Blocks 1 and 2. 
While SCORE (and SCORES) had many significant correlations with 
reference tests* mean rotation time correlated significantly only 
with number of errors on the Cards test. 

When collapsed over blocks, number correct 180 degree 
rotations correlated slightly higher with reference tests than 
the number of correct 90 degree rotations. For example, 
correlations with Paper Folding were .«*6 and .53, and with 
Surface Development and .S^, for 90 and 180 degree rotations, 

respectively. 

Once again, steep slopes were related to verbal ability. 
The rotation time contrast scores (180 time minus 90 time) 
correlated with MAIS Comprehension (.^^), Digit Span (.50) and 
Verbal Scale Score (.«46). For SC0RE3, scores on the rotation 
contrast were positively correlated with two visual memory tests: 
Film Memory III (r=.53) and Visual Number Span (r=.51). Since 
Visual Number Span usually falls with the verbal tests or verbal 
strategy indices* and has frequently opposed the Marks Vividness 
of Visual Imagery Ques t i ona i r e , the interpretation of these 
correlations is uncertain. Those who did not show a large 
decline in correctness between 90 arid 180 degree rotations may 
have had good visual memory, and relied on it to rotate the 
entire stimulus, ignoring other cues that would make the task 
easier but more error prone. 

The linear stimulus complexity scores for the rotation time 
contrast correlated .55 with Digit Span. Similarly, for SC0RE3, 
the linear stimulus complexity contrast correlated -.52 with 
Digit Span and -.^3 with Uses for Things. Thus, those who took 
longer and missed more high stimulus complexity rotations than 
low complexity rotations were probably attempting to encode and 
remember the stimulus or its distinctive features verbally. 
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Table 3.17 

Correlations between Mean SCORE, Mean Rotation Component 
Time and Reference Tests, by Block (N*30) 



SCORE 



Reference Test 


Block 1 


Block 2 


Block 3 


Block 4 


WAIS IQ 


10 


21 


44* 


26 


Paper Folding 


22 


34 


64* 


44* 


Surface Development 


31 


32 


61* 


35 


Cards 


02 


26 


39 


44* 


Figures 


02 


18 


22 


45* 


Total Correct Draw Trials 


28 


43* 


62* 


41 






ROTATION 


TIME 




Reference Test 


Block 1 


Block 2 


Block 3 


Block 4 


WAIS IQ 


23 


25 


-13 


-03 


Cards Wrong 


43* 


55* 


24 


18 


Identical Pictures 


31 


30 


17 


12 


Total Correct Draw Trials 


42* 


44* 


17 


19 



Note. Decimals omitted, r = .42 significant at .01 



Rotation s± Constructed Stimuli . 

The major finding here uas that it took longer to rotate a 
constructed stimulus. Figure 3.8 shous the plot of rotation time 
for each level of the construction facet. Product complexity did 
not affect rotation timer but didr of courser have large effects 
on error rate. Stimulus complexity affected both error rate and 
rotation time, especially for constructions involving high 
complexity stimuli. Koueverr the previously noted limitations of 
these data make interpretations risky. 



Since accepting or rejecting the test stimulus uas the last 
component operation in every item, match component times depend 
on the success of all previous operations. As such, they are the 
most difficult component times to interpret. 

Separate analyses uere performed on the 2^ match lines 
preceded only by encoding and all 216 match times. Although 
analyses of variance uere performed on match times and on the sum 
and difference of separately normalized match times and a 
correctness index, only descriptive statistics and correlational 
results are summarized here. Details of the analyses of variance 
are reported in Lohman (1979b). 

All Hatch Times , Subjects uere first divided into three groups on 
the basis of total SCORE'S over all 216 i tems . Mean match time 
uas then computed for each design facet separately for correct 
and incorrect responses for each group. 
The major findings uere: 

1. Match times uere shorter for correct responses than 
incorrect responses. Mean times uere 1.13 and 1.76 
sec for correct and incorrect responses # 
respectively. This is exactly the opposite uf the 
results for the simple RT control. There, errors 
uere consistently faster than correct responses. 

2. Mean match time decreased linearly over the three 
groups, both for correct and incorrect responses. 
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Further^ incorrect match times uere longer than 
correct times in all three groups. These data are 
shoun in Figure 3.9. 

3. There uas a substantial interaction betueen 

correctness of the test probe and correctness of 
the response. These data are plotted in Figure 
3.10a. "Yew" responses uere faster on correct 
items# and "no" r-esponses uere faster on incorrect 
items. These differ en ces uere strongest in the lou 
and medium groups. 

^ . Cell means for the rotation facet are plotted in 

Figure 3.10b. All three groups shoued an increase 
in match time on rotation items for incorrect 
responses. Houever, only the lou group shoued a 
consistent increase for correct responses. In 
fact* both correct and incorrect match times 
increased linearly over levels of rotation in the 
lou group. In the high group* on the other hand, 
correct match times uere consistently rapid. 

5. For cons t rue t i on f houever* match time for all three 
groups uere greater for construction than no 
construction items. As shoun in Figure 3.10c» the 
mo St pronounced difference betueen groups uas the 
magnitude of the increase betueen no construction 
and one piece additions. 

6. Match times for correct responses increased 
linearly over both the stimulus complexity and type 
of addition facets. Houever» the increases uere 
all less than one second. 

7. Product complexity had large and consistent effects 
for all three groups. These means are plotted in 
Figure 3 . 1 Od . 

Cbrrelations . 

Match time uas the only component time that correlated 
significantly uith spatial tests. Separate correlations uith the 
reference tests uere computed for mean match time over all 216 
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Figure 3.10. Match time for correct and incorrect items for (1) low, 

(2) medium, and (3) high performing groups, over (a) match, 
(b) rotation, (c) construction, and (d) product complexity. 
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items and mean match time over the t4 no construction, no 
rotation items. Four correlations were significant in one or 
both analyses. These correlations are shown in Table 3.18. The 
surprize here was that match time in the no construction, no 
rotation cells correlated with the reference spatial tests as 
well as overall match time. The mean error rate for the 24 no 
construction^ no rotation i t ems was only 4.4 percent, or one 
item. Thus* the correlation is not an artifact of the 
differential match times associated with correctness. Further, 
neither mean encoding time, mean total time (encoding time plus 
match time) nor total correct over these 24 items correlated 
significantly with any of the reference tests. 

It was hypothesized that match times or total time on these 
simple no construction, no rotation items might be related to one 
or more of the perceptual speed tests » particularly Identical 
Pictures. However, mean match time on these items correlated 
only -.33» and total time -.02 with Identical Pictures. 



SMminary n?tgh component Analyses . 

Those who did not perform well on this task showed large 
increases in correct match times between no construction and 
construction items, and a linear increase in correct match tiiKe 
over the rotation facet. Those who performed well showed very 
little increase over the construction facet and no increase over 
rotation in correct match time. This could indicate that low 
ability subjects had not completed the construction or rotation 
operations before advancing to the match frame. 

Two pieces of evidence argue against this interpretation. 
First* although construction times increased dramatically between 
one and two piece additions, match times did not. In fact, in 
the low performance group, mean correct match time was the same 
at levels two and three of the construction facet. On the other 
hand, mean correct match times increased linearly over the 
rotation facet within this low performance group. If these 
subjects were advancing to the next frame before completing the 
requested construction or rotation operation, then they did so 
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Table 3.18 

Correlations between Mean Match Times and Reference Tests (N=30) 



Reference Test 


216 Items 


24 Items 


Form Board 

4 


-40 


-44 


Surface Development 


-47 


-39 


Memory for Designs 


-41 


-48 


Figures 


-46 


-43 



Note, Decimals omitted. 
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differentially for construction and rotation. 

Seconds the highest correlation between mean match time over 
the 2** no construction, no rotation items and reference tests was 
-.U8 with Memory for Designs. The crucial requirements of this 
test are to remember and reproduce the design in its correct 
proportions. Those who have difficulty retaining a given design 
in its correct proportions should have even greater difficulty 
keeping a constructed or transformed (i.e., rotated) figure in 
its correct proportions. The comparison and decision processes 
should take longer for these distorted images than for the 
relatively clear images where no construction or rotation was 
involved . 

Smith (196U) has long arg.ued that the essence of spatial 
ability is the ability to retain a stimulus in its correct 
proportions. The individual differences in match times suggest 
that this may indeed be an important aspect of spatial ability. 

It must be emphasized, however, that the spatial skills 
tapped by the match times were largely independent of the level 
scores on this task. The correlation between mean match time 
over the items and total SCORE was only -.2U. Thus, while 
individual differences in match times were related to spatial 
ability, the basic hypothesis of speed-level independence remains 
unchallenged. 

VJ.. gQinp^rison 3± Overall and Component Analyses 
This section compares and summarizes the major results of 
the total time and correctness, and component time and 
correctness analyses. 

Total Scores 

Individual differences in total correct (by SC0RE3) were 
strongly correlated with spatial ability. Total time, on the 
other hand, was independent of spatial ability and weakly related 
to verbal ability. 

Generally fast accurate performance on this task was 
correlated with only one reference test, WAIS Picture 
Arrangement. The tendency to opt for accuracy instead of speed 
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uas related to general ability. 
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Means Analysis . 

Encoding component time decreased linearly over stimulus 
complexity and the four blocks of test trials. Individual 
differences in mean encoding time uere highly correlated uith the 
intercept parameters from the regressions of total and correct 
times on percent failing each item. Further^ individual 
differences in mean encoding time uere independent of the slopes 
predicted times at maximum difficulty^ and all of the reference 
tests» except number of pencil marks made on the Form Board test. 

Contrast CQrrQlatlpn? » 

Relatively large increases in enc3ding time over the stimulus 
complexity facets especially between medium and high complexity^ 
uere associated uith verbal ability and memory span. It uas 
suggested that the reported strategies of attempting to remember 
a complex initial stimulus by vieuing it as a combination of 
simpler forms» or attempting to associate the stimulus uith a 
concrete image uould result in relatively long encoding times for 
high complexity stimuli. 

Change in encoding time over blocks uas not related to the 
reference tests or overall performance on the experimental task. 



Means An^lVSig > 

Tuo piece additions took much longer than one piece 
additions^ but did not result in more errors. On all other 
facets related to construction^ error rate and total or component 
time increased together. The most difficult constructions uere 
left additions of complex stimuli that combined to form high 
complexity products. Left additions uere more difficult than 
right additions only uhen tuo pieces had to be added to the base. 
Left additions that resulted in high complexity products uere 
more difficult than corres ponding right additions. Difficulty 
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increased nonlinearly over stimulus complexity, with the 
difference between high and medium complexity larger than the 
corresponding difference between medium and low complexity 
stimuli. The largest changes in difficulty were captured by the 
product complexity facet, with high complexity products much more 
difficult to construct than low complexity products. 

Contrast Correlations . 

U Cont r^st? and Cel 1 Totals , Mean construction component 
time correlated significantly with six verbal tests. Mean 
construction time on easy items was independent of the reference 
t es ts » wh i 1 e mean time on difficult i t ems was highly correlated 
with verbal ability. Total SCORES, on the other hand, was 
strongly related to performance on spatial tests, ooth for easy 
and difficult' items. 

Cpngtrygtlon (£, . in both the total time and construction 
component analyses, steep slopes were related to verbal ability. 
Further, the quadratic contrast in the total time analysis 
correlated positively with reported vividness of visual imagery 
and negatively with Hidden Figures. The linear SC0RE3 
construction contrast scores correlated positively with Form 
Board and Object Assembly, while the quadratic contrast scores 
had equally large, but negative correlations with these two 
tests. Thus, high scores on the Form Board and Object Assembly 
reference tests were related to a linear decline in performance 
over the three levels of the construction facet. 

For the combined latency and correctness indices, larger 
increases in difficulty between one and two piece additions than 
between zero and one piece additions were, once again, related to 
Form Board and Object Assembly. However, magnitude of the simple 
difference between difficulty of one and two piece additions was 
related to verbal ability. 

Shifting to slower, more accurate performance between levels 
one and three was associated with verbal ability, while 
relatively larger shifts between one and two piece additions than 
between zero and one piece additions were negatively related to 
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spatial ability. Thus ^ high spatial ability subjects shoued the 
greatest change in speed-*accuracy tradeoff betueen no addition 
and one addition items^ rather than betueen one and tuo piece 
additions • 

St imu 1 us Complexi t v . In both the total time and construction 
component analyses^ the 1 inear stimulus complexity contrast uas 
related to verbal ability. 

The quadratic contrast uas strongly related to verbals verbal 
reasoning^ and perceptual speed tests in the total time analysis* 
but more related to general ability in the construction component 
time analysis . 

Both linear and quadratic SC0RE3 stimulus complexity 
constrasts uere independent of the reference tests in the overall 
analysis* but ueakly related to Copying and Figures in the 
analysis of the 1^^ construction items. 

For the composite difficulty index* those uho took much 
longer on constructions involving high complexity stimuli than on 
these using lou complexity stimuli scored higher on a cluster of 
numerical or reasoning tests. A relatively larger increase in 
difficulty betueen medium and high complexity than betueen lou 
and medium complexity uas related to both spatial ability and 
memory span . 

Strong shifts to slouer* more accurate responding betueen lou. 
and high stimulus complexity additions uas associated uith high 
scores on uord fluency tests. 

Type of Add i t i on , In both total time and construction 
component analyses* taking longer on left additions than right 
additions uas related to good performance on Object Assembly and 
verbal tests. Further* those uho missed many more left additions 
than right additions had high scores on the Visual Number Span 
test* uhile those uhose performance uas about the same on both 
reported strong visual imagery. 

For the combined latency and correctness indices* more errors 
and longer latencies on left additions uere even more strongly 
associated uith high scores on Visual Number Span and lou scores 
on the Marks Vividness of Visual Imagery Questionaire. Suitching 
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to slower, more accurate responding on left additions was 
indicative of high scores on Form Board and Object Assembly. 

Prgdygt CgmPteMUy- in both the total time and construction 
component time analyses, taking longer on high product complexity 
additions was strongly related to general ability. Time taken on 
low product complexity additions was also negatively correlated 
with Closure Speed. On the other hand, making relatively more 
errors on high complexity additions was weakly related to verbal 
ability. 

The combined latency and correctness difficulty indices were 
also strongly related to general ability in both analyses. The 
magnitude of the speed-accuracy tradeoff, on the other hand, was 
uncorrela.ed with the reference tests and only weakly related to 
an ••internal" variable, total number of correct draw trials. 

Rotation 

nean? Analysis . 

Total time increased nonlinearly while correctness decreased 
linearly over the rotation facet. The combined difficulty index, 
however, showed a perfectly linear increase. Further, there was 
a general shift to slower, more careful responding when rotation 
was required. 

Both rotation time and ^rrors decreased linearly over the 
four blocks. The decrease was largest for errors on 180 degree 
rotations . 

Although the interaction between stimulus complexity and 
rotation was significant in both the component time and SC0RE3 
analyses, the interaction was different in each block and 
significant only in the first two blocks. 

Rotation component time increased linearly over the 
construction facet. However, interpretation is difficult since 
construction always preceded rotation. 

Contrast Correlations . 

H 

K Cgntrggt* Mean rotation time correlated significantly with 
number of errors on the Cards test. When analyzed by block, the 



correlation uas significant only in the first block. Total 
number correct on rotation only items correlated significantly 
uith spatial ab?lity» particularly in blocks three and four. The 
highest correlations in block four were with Figures, Cards, and 
Paper Folding. Number of correct 180 degree rotations correlated 
slightly higher uith spatial refer en ce tests than number o f 
correct 90 degree rotations. 

Rotation . Steep slopes were associated with Cards Wrong in 
the total time analysis (R = 0, 90, 180), and verbal ability and 
Digit Span in the component time anal ys is (R = 90, 180). A 
linear decline in correctness uas related to high scores on 
Figures and Cards in the overall analysis (R - 0, 90, 180), and 
high scores on two visual memory tests in the analysis of the ^8 
rotation only items (R = 90, 180). 

Stimulus Complexity . A linear increase in rotation time and 
a linear decrease in correctness over stimulus complexity uere 
related to high scores on memory span tests, particularly WAIS 
Digit Span Backwards. Further, those uho made relatively more 
errors on rotations of high comple:<ity stimuli performed uell on 
Form Board, Object Assembly, and tieiiiory for Designs. 

Hatch 

Means Analysis . 

Watch uas the only nonsignificant main effect in the analysis 
of total time. This uas the result of a substantial interaction 
betueen correctness of the test probe and correctness of the 
response. In particular, **yes** responses uere faster on correct 
items and ^'no'* responses faster on incorrect items . Those uho 
performed uell on the experimental task shoued no increase in 
match time over the rotation facet and a small increase betueen 
construction and no construction items. Those uho performed 
poorly shoued a linear increase in match time over the rotation 
facet and a large increase in match time on construction items. 
Both correct and incorrect match times uere consistently longer 
for high than for lou product complexity stimuli. 
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Correlations . 

For the match facet in the total time analysis, faster 
responses to positive test stimuli were related to high scores on 
the Form Board and Picture Completion tests. Faster responses to 
negative test stimuli were associated with more errors on the 
Cards test and higher scores on Visual Number Span. 

Mean match component time uas the only component time that 
correlated significantly with spatial ability. These 
correlations were the same for all 216 items and the ^^ no 
cons t ruct ion» no rotation items. However^ mean match time uas 
not significantly correlated uith level scores on the 
experimental task. 



The results of this experiment have several implications for 
research on aptitude processes. 

The first important finding uas that individual differences 
in speed of solving simple items uere independent of level scores 
on the experimental task and reference tests. ThuSf it is 
unlikely that a process understanding of spatial ability uill be 
obtained by modeling individual differences in speed of solving 
simple spatial tasks. 

The second important finding uas that individual differ en ces 
in the speed of solving complex tasks uere consistently related 
to verbal reference tests. The implication here is that at least 
some subjects solved complex items using verbal-analytic 
strategies. This does not mean that all subjects used nonspatial 
strategies on complex items, or that individual differences in 
the speed of solving complex items are generally unrelated to 
spatial ability. The most likely possibility is that some 
subjects solve complex items on spatial tests using spatial 
strategies* others use verbal-analytic strategies, uhile moct use 
both spatial and verbal processes. The high positive 
correlations betueen solution latency for complex items and 
verbal reference tests probably result from greater variance in 
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solution latency for those items solved by verbal-analytic 
strategies. This greater variance in latency in the nonspatial 
group would completely overshadou the much smaller variation in 
latency scores in the spatial group. Correlations for the total 
group would then tend to reflect the correlation in the verbal 
and mixed strategy groups. 

The third important result was that level scores on this task 
were highly correlated with scores on a number of complex spatial 
tests in the r ef er enc e bat t er y . Thus» the findings are not 
limit ed to this particular task. This suggests that these 
reference spatial tests also can be solved in more than one way. 
Those who have studied retrospective and introspective reports of 
solution strategy on spatial tests have reached a similar 
conclusion (see Lohman, 1979a for a review). In shorts the major 
challenge for any spatial test» whether simple or complex^ is to 
keep it spatial. 

There are two additional implications of the '^nons pat i a 1 ** 
nature of most spatial tests. Firsts verbal and spatial 
abilities may be more independent than the literature on mental 
tests suggests. Second* if some subjects solve items on a 
carefully constructed spatial test or experimental^ task using 
nonspatial strategies^ then perhaps spatial ability is not that 
importarit in the real world (c.f. Cronbachf 1970)# excepts of 
course* when solution time is a crucial factor (e.g.^ piloting an 
airplane (Guilford C Lacey# 1947)). It is also possible that 
spatial ability tests would have higher validity coefficients if 
they could be made truly spatial. One simple way to do this 
would be to utilize both error and latency data^ as in the SCORES 
index used in this study. 

If some subjects solved some items on this experimental task 
using nonspatial strategies^ then the obvious next question is 
who was using these nonspatial strategies^ and on what items? 
Unfortunately^ the experimental task was not designed to answer 
these questions. In the present experiments the predicted 
patterns for latency and error data over the design facets are 
the same for both spatial and verbal-analytic solution 
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strategies. What is needed is an experimental task for which 
different solution strateagies would yield qualitatively 
different patterns of scores over specified design facets. 

The most likely candidates for this type of experiment are 
the first and last component processes; namely encoding and 
discriminative response (or match). The encoding phase is 
important because the nature of the internal representation of 
the stimuli may determine, predispose, or at least limit the type 
of transformational processes that may operate on it. Thus, the 
subject who "burns in** a holistic representation is more likely 
to rotate the entire stimulus than one who encodes the same 
stimulus as ••a rectangle with a small tri^angle cut out of the 
right side.** 

At the other end of the item, individual differences in 
component latencies for the discriminative response were the only 
latency measures that correlated with spatial ability. Further, 
scoring items with abnormally long discriminative response times 
incorrect was the major factor in making SCORES a '•cleaner** 
measure of spatial ability than total number correct (SCORE). 
Cooper (1976) has also found important individual differences in 
discriminative response time on spatial tasks. 

For both encoding and discriminative response component 
times, the crucial factor appears to be the nature of the 
internal representation the subject is either constructing or 
comparing with the test stimulus. Subjects who attempted to 
decompose complex stimuli into more familiar, labelable pieces 
took longer to encode, add, rotate, and compare the result with a 
test st imulus . 

Another result that underscores the importance of the 
internal representation was that individual scores for the 
product complexity contrast were the best predictors of level 
scores on the experimental task. The interesting aspect of this 
facet is that it is a character istic of the internal 
representation subjects are assumed to construct, and is 
independent of the complexity of the stimuli the subject actually 
sees. Again, however, subjects using both ve r ba 1 -ana 1 y t i c and 
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spatial stratgies should shou effects in the same direction over 
the facet. 

This study has also pointed to some important methodological 
considerations for research on aptitude processes. First* it 
underscores the important differ en ces betueen individual 
differences in speed and level. Inferences about the nature of 
general aptitude constructs based on experiments that model 
latency for relatively error free tasks are suspect. 

Second* the method of computing sum and difference scores for 
the joint analyses of error and latency data* uhile superior to 
other methods that have been proposed (Pachella* 197^; Sternberg* 
1977)* is still inadequate. Uhen a subject fails to solve tin 
item* the reaction time for that item is ambiguous. Statistical 
manipulations cannot erase the psychological ambiguity of the 
data. A retreat to simple tasks and error free performance is 
not the answer* as this study has^^mply demonstrated. The 
important methodological realization is that models that account 
for performance on complex items are necessarily models for high 
ability subjects. Ability is confounded uith item complexity 
because it is defined by the maximum difficulty of the items the 
subject can solve. Experimentally* then* a tailored testing 
approach should prove more useful than the typical procedure 
uhere all subjects attempt all items. The motivation of the lou 
ability subjects should also improve since they uould not be 
continually confronted uith items that they cannot solve. 
Rectangular rather than representative sampling of subjects could 
then be used to insure the same number of high* medium* and lou 
ability subjects n the design. Lou ability subjects uould 
provide unambiguous latency data for easy items* medium ability 
subjects for easy and medium difficulty items* ^nd high ability 
subjects for easy* medium* and difficult items. The analysis 
could then contrast the uay lou^ medium* and high ability 
subjects solve easy items* the uay medium and high ability 
subjects solve medium difficulty items and the uay high ability 
subjects solve difficult items< Contrasts uithin an ability 
group could also be made* such as betueen the models that best 

1 



fit the performance of high ability subjects on easy, medium, and 
difficult items. 

Another implication for future research stems from the 
conclusion that subjects solve different items in different ways. 
The analysis must so beyond fitting one model for each subject. 
It must focus on the uay a particular subject solves a particular 
item type. Level of practice, item difficulty, and 
susceptibility of the item to alternative solution strategies are 
threft possible blocking factors for such a design. 
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CHAPTER 11 

SUMMARY 

Previous research on the nature of aptitude processes has 
ignored the important differences betueen speed and level oi 
performance. Houever^ investigations of the relationship betueen 
individual differences in speed and level suggest that speed of 
solving simple spatial tasks is largely independent of level 
attained on more :ompleK spatial tasks. 

The present experiment uas designed to investigate the 
relationships betueen speeds level* and complexity in individual 
d i f f erences in spatial ability. Thirty male high school and 
col lege students uere selected to represent a uide range of 
individual differences in verbal ability and spatial ability. 
Subjects uere required to match* rotate* combine* and combine and 
then rotate three to eight point polygons. Three levels of 
construction (one* tuo* or three stimuli)* three levels of 
stimulus complexity (lou* medium* or high)* three levels of 
rotation 90* or 180 degrees)* and tuo types of discriminative 

response (correct or incor r ^r-; ^' ' uere fully crossed in the design. 
Additionally* type of additio: (left or right) and complexity of 
the to-be-constructed stimuli uere nested uithin levels of the 
construction facet but crossed uith each other and all other 
facets. Since each subject uas administered all 216 items'* 
subjects uere the seventh fully crossed factor. 

Thirty trials on uhich the subject uas asked to drau the 
final stimulus rather than taerely to recognize it uere randomly 
interspersed uith the test trials. A simple reaction time 
control condition preceded the experiment. 

Correctness * confidence* and from tuo to four component times 
uere obtained on each item. Total time uas determined by summing 
the component times. These data uere analyzed uping regression* 
analysis of variance* and correlational techniques. 

In the regression analyses* latency measures uere regressed 
on a group difficulty scale. Correlations betueen intercepts* 
slopes* predicted times at maximum difficulty* and level scores 
and reference constructs supported the basic hypothesis of speed- 
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level independence. In particular, individual differences in 
speed of solving simple items uere independent of both level on 
the task and reference abilities. Speed of solving complex items 
uas strongly correlated with level on the task, but more highly 
correlated uith verbal than spatial reference constructs. 

In the means analysis, repeated measures analyses of variance 
uere performed on total time and correctness, component times and 
correctness, and combined indices representing the sum and 
difference of independently standardized latency and correctness 
scores. These combined analyses were performed on both total 
time and correctness as well as each component time and its 
correctness index. 

Relationships between . individual differences in total scores, 
cell totals, and the pattern of cell means over design facets 
were explored in the correlational analyses. In general, 
correlations between correctness indices and reference tests 
supported the construct validity of the task and its facets, 
while correlations between latency indices and reference tests 
suggested more about the processes or strategies subjects were 
using to solve the items. 

The major results of the means analyses and correlational 
analyses are summarized below. 



Individual differences in total correct (by SCORES) were 
strongly correlated with spatial ability. Total time, on the 
other hand, was independent of spatial ability and weakly related 
to verbal abil i ty . 

Generally fast accurate performance on this tiisk was? 
correlated with only one reference test, UAIS Picture 
Arrangement. The tendency to opt for accuracy instead of speed 
was related to general ability. 



Pe^ng Analysis . 

Encoding component time decreased linearly over stimulus 
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complexity and the four blocks of test trials. Individual 
differences in mean encoding time uere highly correlated with the 
intercept parameters from the regressions of total and correct 
times on percent failing each iteitr. Further* individual 
differences in mean encoding time uere independent of the slopes 
predicted times at maximum difficulty^ and all of the reference 
tests* except number of pencil marks made on the Form Board test. 

Contrast Correlat ions . 

Relatively large increases in encoding time over the stimulus 
complexity facet* especially between medium and high complexity* 
uere associated uith verbal ability and memory span. It uas 
suggested that the reported strategies of attempting to remember 
a complex initial stimulus by vieuing it as a combination of 
simpler forms* or attempting to associate the stimulus uith a 
concrete image uould result in relatively long encoding times for 
high complexity stimuli. 

Change in encoding time over blocks uas not related to the 
reference tests or overall performance on the experimental task. 



Means Analysis. 

Tuo piece additions took much longer than one piece 
additions* but did not result in more errors. On all other 
facets related to construction* error rate and total or component 
time inc r eased together. The most difficult constructions uere 
left additions of complex stimuli that combined to form high 
complexity products. Left additions uere more difficult than 
right additions only uhen tuo pieces had to be added to the base. 
Left additions that resulted in high complexity products uere 
more difficult than corresponding right additions. Difficulty 
incr eased honlinearly over stimulus complexity* uith the 
difference betueen high and medium complexity larger than the 
corresponding difference betueen medium and lou complexity 
stimuli. The largest changes in difficulty uere captured by the 
product complexity facet* uith high complexity products much more 
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difficult to construct than lou complexity products. 

g9n<;ragt Correlatinn« 

It gqntriSts flM fiSLil laials.. Mean construction component 
time correlated significantly with six verbal tests. Mean 
construction time on easy items uas independent of the reference 
tests, uhile mean time on difficult items uas highly correlated 
uith verbal ability. Total SC0RE3, on the other hand, uas 
strongly related to performance on spatial tests, both for easy 
and difficult items. 

C9ngtruct i <?Tl (fi» K) . in both the total time and construction 
component analyses, steep slopes uere related to verbal ability. 
Further, the quadratic contrast in the total time analysis 
correlated positively uith reported vividness of visual imagery 
and negatively uith Hidden Figures. The linear SC0RE3 
construction contrast scores correlated positively uith Form 
Board and Object Assembly, uhile the quadratic contrast scores 
had equally large, but negative correlations uith these tuo 
tests. Thus, high scores on the Form Board and Object Assembly 
reference tests uere related to a linear decline in performance 
over the three levels of the construction facet. 

For the combined latency and correctness indices, larger 
increases in difficulty betueen one and tuo piece additions than 
betueen zero and one piece additions uere, once again, reluted to 
Form Board and Object Assembly. Houever, magnitude of the simple 
difference betueen difficulty of one and tuo piece additions uas 
related to verbal ability. 

Shifting to slouer, more accurate performance betueen levels 
one and three uas associated uith vfeibal ability, uhile 
relatively larger shifts betueen one and tuo piece additions than 
betueen zero and one piece additions uere negatively related to 
spatial ability. Thus, high spatial ability subjects shoued the 
greatest change in speed-accuracy tradeoff betueen no addition 
and one addition items, rather than betueen one and two piece 
additions. 

S^^^^^^s gPl^PUKUv.. In both the total time and construction 
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component analyses, the linear stimulus complexity contrast was 
related to verbal ability. 

The quadratic contrast was strongly rela>ed to verbal, verbal 
reasoning, and perceptual speed tests in the total time analysis, 
but more related to general ability in the construction component 
time analysis. 

Both linear and quadratic SC0RE3 stimulus complexity 
constrasts were independent of the reference tests in the overall 
analysis, but weakly related to Copying and Figures in the 
analysis of the construction items. 

For the composite difficulty index, those who took much 
longer on constructions involving high complexity stimuli than on 
those using low ::omplexity stimuli scored higher on a cluster of 
numerical or reasoning tests. A relatively larger increase in 
difficulty between medium and high complexity than between low 
and medium complexity was related to both spatial ability and 
memory span. 

Strong shifts to slower, more accurate responding between low 
and high stimulus complexity additions was associated with high 
scores on word fluency tests. 

Type of Addition . In both total time and construction 
component analyses, taking longer on left additions than right 
additions was related to good performance on Object Assembly and 
verbal tests. Further, thosa who missed many more I'eft additions 
than right additions had high scores on the Visual Number Span 
test, while those whose performance was about the same on both 
reported strong visual imagery. 

For the combined latency and correctness indices, more errors 
and longer latencies on left additions were even more strongly 
associated with high scores on Visual Number Span and low scores 
on the narks Vividness of Visual Imagery Clues t i onai re . Switching 
to slower, more accurate responding on left additions was 
indicative of high scores on Form Board and Object Assembly. 

Product CQmolexitv . In both the total time and construction 
component time analyses, taking longer on high product complexity 
additions was strongly related to general ability. Time taken on 




lou product complexity additions uas also negatively correlated 
with Closure Speed. On the other hand, making relatively more 
errors on high complexity additions uas weakly related to verbal 
ability. 

The combined latency and correctness difficulty indices uere 
also strongly related to general ability in both analyses. The 
magnitude of the speed-accuracy tradeoff, on the other hand, uas 
uncorrelated with the reference tests and only weakly related to 
an ••internal" variable* total number of correct draw trials. 



Hcing Analysis . 

Total time increased nonlinearly while correctness decreased 
linearly over the rotation facet. The combined difficulty index, 
however, showed a perfectly linear increase. Further, there was 
a general shift to slower, more careful responding when rotation 
was required. 

Both rotation time and errors decreased linearly over the 
four blocks. The decrease was largest for errors on 180 degree 
rotations . 

Although the interaction between stimulus complexity and 
rotation was significant in both the component time and SCORES 
analyses, the interaction was different in each block an^. 
significant only in the first two blocks. 

Rotation component time increased linearly over the 
construction facet. However, interpretation uas difficult since 
construction always preceded rotation. 

ggntragt Correlations . 

M Contrast. Mean rotation time correlated significantly with 
number of errors on the Cards test. When analyzed by block, the 
correlation was significant only in the first block. Total 
number correct on rotation only items correlated significantly 
with spatial ability, particularly in blocks three and four. The 
highest correlations in block four were with Figures, Cardsr and 
Paper Folding. Number of correct 180 degree rotations correlated 
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slightly higher with spatial reference tests than number of 
correct 90 degree rotations. 

Rot at i on . Steep slopes uere associated with Cards Mrong in 
the total time analysis (R « Q, 90» 180)^ and verbal ability and 
Digit Span in the component time anal ys is (R » 90, 180). A 
linear decline in correctness uas related to high scores on 
Figures and Cards in the overall analysis (R « Q, 90> 180)» and 
high scores on tuo visual memory tests in the analysis of the 
rotation only items (R « 90» 180). 

Stimulus Comol exi tv . A linear increase in rotation time and 
a linear decrease in correctness over stimulus complexity uere 
related to high scores on memory span tests^ particularly UAIS 
I>igit Span Backuards. Further^ those uho made relatively more 
errors on rotations of high complexity stimuli performed uell on 
Form Boards Object Assembly* and Memory for Designs. 

Match 

Means Analysis . 

Match uas the only nonsignificant main effect in the analysis 
of total time. This uas the result of a substantial interaction 
betueen correctness of the test probe and correctness of the 
response. In particular^ '^yes'* responses uere faster on correct 
items and **no** responses faster on incorrect items. Those uho 
performed uell on the experimental task shoued no increase in 
match time over the rotation facet and a small increase betueen 
construction and no construction items. Those uho performed 
poorly shoued a 1 in ear increase in match time over the rotation 
facet and a large increase in match time on construction items . 
Both correct and incorrect match times uere consistently longer 
for high than for lou product complexity stimuli. 

Correlations . 

For the match facet in the total time analysis > faster 
responses to positive test stimuli uere related to high scores on 
the Form Board and Picture Completion tests. Faster responses to 
negative test stimuli uere associated uith more errors on the 
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Cards test and higher scores on Visual Number Span. 

Mean match component time uas the only component time that 
correlated significantly with spatial ability. These 
correlations uere the same for all 216 items and the 2U no 
const ruction^ no rotation items. Houever, mean match time uas 
not significantly correlated with level scores on the 
experimental task. 
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CHAPTER V 

CONCLUSIONS 

1. The most important result of this experiment uas the 
finding that individual differences in speed of doing simple 
spatial tasks are unrelated to verbal ability^ spatial abilit/t 
or level scores on the experimental task. Speed of solving 
complex items uas highly correlated with tuo task level scores 
and verbal reference tests. 
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2. In generals individual differences in the patterns of 
correctness cell means over facets supported the construct 
validity of the facets while individual differences in the 
patterns of latency cell means over facets suggested more about 
the processes or strategies subjects uere using to solve the 
items. Correlations between individual conttast scores and 
reference tests provided a powerful method for examining these 
relationships . 

3. Large interactions between facets implied that a simple 
additive mathematical model or process model that postulated 
independent stages would not describe the?e data well. 
Complexity or difficulty increased nonlinearly. 

4. Since time and correctness related di^fferently to external 
variables^ a complete analysis must examine both. A method 
designed to analyze their joint effects was proposed. Time and 
correctness were first independently standardized to zero mean 
and unit v<dri«fnce. The sum and difference of these two scores 
were then computed. The difference score represented a composite 
index oi difficulty^ while the sum score reflected the unique 
contribution of &«^ch variable # or variations in speed-accuracy 
tradeoff . Although new relationships sometimes emerged when 
contrast :^core5 far these variables were correlated with 
reference construcl:^^ it was concluded that correlational 
analyses were b^nt performed on the original latency and 
correctness variables. The composite variables were# however^ 



quite appropriate for the means ana 1 ys i s . 



5. The relationships between overall performance^ performance 
on a particular item type, the patterns of performance over 
facets, and reference tests provided insights into the nature of 
the task and the nature of the reference constructs. Total 
number correct uas strongly correlated uith a cluster of complex 
spatial tests called Visualization in the factor analytic 
literature. Performance on the construction facet uas 
consistently related to Form Board and Object Assembly, 
suggesting that mental construction may indeed be an important 
spatial skill. Similarly, performance over the rotation facet 
uas related to performance on tuo tests (Cards and Figures) that 
usually define the factor called Spatial Relations. The Closure 
Speed factor uas related to time spent on constructions that 
yielded a low complexity product. Other clusters of tests 
representing such diverse factors as Reasoning (or Number), Word 
Fluency, Visual Memory, and Memory Span uere also related to 
particular patterns of performance on the task. 

6. Several important aspects of the spatial construction task 
were identified. In particular, mental construction uas 
differentially affected by the number of stimulus elements to be 
combined, the complexity of these stimuli, the location (or type) 
of the addition* and, most significantly, the complexity of the 
to-be-constructed stimulus. Further, large increases in latency 
over construction, stimulus complexity, and type of addition uere 
related to verbal ability and verbal solution strategies. Rapid 
construction on lou product complexity items uas related to 
Closure Speed, uhile good overall performance uas related to Form 
Board and Object Assembly. Finally, differential correlations 
uith Hidden Figures and Marks Vividness of Visual Imagery 
Questionaire suggested that holistic and analytic spatial 
construction strategies must be distinguished. 

7. Analysis of the rotation items confirmed Cooper and 
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Podgorny*s (1976) finding that rate o£ rotation uas independent 
of stimulus compleKit/. Examination of individual differences 
added that steep reaction time slopes for rotation were related 
to verbal ability^ short term verbal memory* and verbal 
strategies. Further* large increases in rotation time for 
complex stimuli were also associated uith verbal memory span 
tests . 

8. Some subjects attempted to solve some items on this task 
using nonspatial .strategies. However* since total SCORERS uere 
highly correlated uith performance on other complex spatial 
tests r these tests are also suspect • Those uho have stud ied 
retrospective reports of solution strategies on spatial tests 
have reached the same conclusion (see Lohman* 1979a* for a 
revieu). The problem is particularly acute on complex spatial 
tasks. Some subjects solve easy items using spatial strategies* 
but find that these strategies break doun as item difficulty 
increases* They then resort to verbal-analytic strategies. 
Strategies also change uith practice. Attempting to eliminate 
individual differences in solution strategy by keeping the task 
simple uill not do* as simple tests are also solved in different 
uays by different subjects. Further* as this study has shoun* 
individual d if ferine es in speed of solving simple spatial tasks 
are largely independent of more generalizable level scores on 
complex tasks of the same type. Future research on spatial 
ability must attend to these important intra-sub ject and inter- 
subject differences in solution strategy. One uay to do this is 
through experiments that are designed to yield qualitatively 
different patterns of scores for different solution strategies. 
Proportional subject sampling* selective it em administration* and 
modeling performance at the level of item types uithin 
individuals are also recommended as part of the proposed research 
strategy* 
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